<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>aflow.org: A web ecosystem of databases, software and tools</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>01/01/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10442668</idno>
					<idno type="doi">10.1016/j.commatsci.2022.111808</idno>
					<title level='j'>Computational Materials Science</title>
<idno>0927-0256</idno>
<biblScope unit="volume">216</biblScope>
<biblScope unit="issue">C</biblScope>					

					<author>Marco Esters</author><author>Corey Oses</author><author>Simon Divilov</author><author>Hagen Eckert</author><author>Rico Friedrich</author><author>David Hicks</author><author>Michael J. Mehl</author><author>Frisco Rose</author><author>Andriy Smolyanyuk</author><author>Arrigo Calzolari</author><author>Xiomara Campilongo</author><author>Cormac Toher</author><author>Stefano Curtarolo</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[To enable materials databases supporting computational and experimental research, it is critical to develop platforms that both facilitate access to the data and provide the tools used to generate/analyze it -all while considering the diversity of users' experience levels and usage needs. The recently formulated FAIR principles (Findable, Accessible, Interoperable, and Reusable) establish a common framework to aid these efforts. This article describes aflow.org, a web ecosystem developed to provide FAIR-compliant access to the AFLOW databases. Graphical and programmatic retrieval methods are offered, ensuring accessibility for all experience levels and data needs. aflow.org goes beyond data-access by providing applications to important features of the AFLOW software [1], assisting users in their own calculations without the need to install the entire high-throughput framework. Outreach commitments to provide AFLOW tutorials and materials science education to a global and diverse audiences will also be presented.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The advantage of big data has long been recognized in computational materials science, resulting in large databases of experimental and hypothetical materials. The Automatic FLOW Framework for Materials Discovery (AFLOW) is the largest database for computationally investigated materials, with over 3.5 million materials entries <ref type="bibr">[2]</ref> and over 1,100 crystallographic prototypes to classify crystals and generate input structures.</p><p>The data should not just be publicly available, but also easily accessible to users with varying usage requirements, from casual singleentry access to high-throughput retrieval. To provide a common framework for such an infrastructure, the FAIR principles for data stewardship were developed, where FAIR stands for ''Findable'', ''Accessible'', ''Interoperable'', and ''Reusable'' <ref type="bibr">[3]</ref>.</p><p>This article introduces the AFLOW web ecosystem, which consists of the AFLOW databases and web interfaces to access features of the AFLOW software. We will start with the aflow.org repository, its data structure, and how it conforms with the FAIR data principles. hulls for thermodynamic analysis (AFLOW-CHULL), and use AFLOW machine learning applications (AFLOW-ML). These applications will be introduced after the discussion about the AFLOW database.</p><p>Lastly, we will present the AFLOW School and seminar series. Making software usable and accessible goes beyond just making it open source. Tutorials are an invaluable way to reduce the barrier to entry for new users. The Schools and seminars represent AFLOW's commitment to providing free education about its software and materials science to a global and diverse audience.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The aflow.org repository</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Database content and organization</head><p>The full data set generated by the high-throughput AFLOW process <ref type="bibr">[2]</ref> is currently over 40 TB. Stored data includes input criteria and other calculation details (to facilitate reproducibility and fulfill the ''Reusable'' criterion of the FAIR principles), calculated results, and derivative output. To ensure the robust, long-term fulfillment of the FAIR criteria of accessibility and findability for such a large data store, the AFLOW repository <ref type="bibr">[2]</ref> is organized in a human-navigable hierarchical directory structure, allowing specific entries to be accessed independently of the database engine. Specifically, the repository is organized into project, set and calculation layers, as illustrated in Fig. <ref type="figure">1</ref>. At the project layer, the calculations are divided into different catalogs based on the origin and composition of the entries <ref type="bibr">[11,</ref><ref type="bibr">12]</ref>. Within each catalog, entries are grouped into sets based on shared lattice type or alloy system. The entries within each set contain the results of density functional theory (DFT) calculated properties for particular structures.</p><p>The AFLOW-ICSD catalog contains the DFT-calculated properties for over 60,000 experimentally observed materials listed in the Inorganic Crystal Structure Database (ICSD) <ref type="bibr">[13,</ref><ref type="bibr">14]</ref>. Internally, this catalog is organized by lattice type, and then by individual materials entry. The entries in this catalog are grouped by Bravais lattice type into 14 sets: BCC, BCT, CUB, FCC, HEX, MCL, MCLC, ORC, ORCC, ORCF, ORCI, RHL, TET, and TRI. The name of each materials entry is generated using the format &lt;composition&gt;_ICSD_&lt;ICSD number&gt;. To aid reproducibility, the species labels in the composition part of the entry name are the full VASP pseudopotential (POTCAR) names used for that calculation. Since the materials in this catalog are already known to exist, the primary interest is in accurately calculating electronic structure and thermo-mechanical properties. Therefore, calculations for this catalog are generally performed using the Hubbard &#119880; correction to the DFT exchange-correlation functional <ref type="bibr">[15,</ref><ref type="bibr">16]</ref> where appropriate, using a set of standardized &#119880; values <ref type="bibr">[17]</ref>.</p><p>The entries in the other catalogs, such as LIB1, LIB2, and LIB3, are generated by decorating crystal structure prototypes to predict new hypothetical compounds, and contain unary, binary, and ternary materials, respectively. Within each catalog, the entries are grouped by element and exchange-correlation functional in the case of LIB1, and by alloy system in the cases of LIB2 and LIB3. LIB1 contains a total of 4k entries, while LIB2 currently has 364k entries and LIB3 has about 2.6 million. Within each alloy system, the individual materials entries are named according to the relevant crystal prototype. For example, the directory path for the composition AlCu 2 Mg in the Heusler structure is LIB3_WEB/AlCu_pvMg_pv/T0001.AB2C, where LIB3_WEB is the project layer corresponding to LIB3, AlCu_pvMg_pv is the set layer corresponding to the Al-Cu-Mg alloy system (using the Al, Cu_pv and Mg_pv pseudopotential POTCARs), and T0001.A2BC is the A2BC decoration of the Heusler prototype T0001. For these catalogs, the emphasis is on the discovery of new thermodynamically stable or metastable materials, and on their use to generate the thermodynamic density of states for the prediction of the formation of disordered materials such as metallic glasses <ref type="bibr">[18]</ref> or high entropy alloys <ref type="bibr">[19]</ref>. Therefore, calculations in these catalogs are performed using the GGA-PBE exchange-correlation functional <ref type="bibr">[20]</ref> without Hubbard &#119880; corrections <ref type="bibr">[17]</ref> so as to produce consistent energy differences, enabling the calculation of accurate formation enthalpies.</p><p>Categorizing data based on the number of species, composition, and crystal prototype is an intuitive, human-readable choice. However, this data structure introduces inefficiencies that make it unsuitable for machine reading. The number of possible element combinations and crystal prototypes increases with the number of species. This results in an unbalanced search tree as illustrated in Fig. <ref type="figure">1(a)</ref>. For instance, the catalog of binary (LIB2) and ternary entries (LIB3) may have 10 3 and 10 5 entries, respectively. In the alloy layer, the number of entries may be unevenly distributed as well. This data structure is inefficient to query and index: walking and searching through LIB3 is significantly slower than through LIB2, resulting in uneven performance and preventing efficient parallelization. Using other materials properties to structure the data would present similar challenges as no property is evenly distributed across all systems.</p><p>The AFLOW Unique IDentifier (AUID) system eliminates this computational inefficiency. It consists of the aflow namespace declaration, followed by a quasi-random 64-bit (8-byte) hash, for example: aflow:d3aa24a7307877b5.</p><p>This identifier has the capacity for 2 64 &#8776; 10 19 entries and can be further extended by using different namespaces. The AUIDs are generated by hashing the alphabetically ordered output files (OUTCAR) of the AFLOW run using a 64-bit cyclic redundancy check (CRC64) algorithm.</p><p>Since the content of these files depends on parameters specific to the calculation, such as the options of the VASP calculations or the input and final structure, the resulting AUID is globally unique to the entry. Changing the cut-off energy, for example, would result in a different identifier. Using OUTCAR files as the basis for the algorithm also makes the AUID reproducible and thus persistent, fulfilling the ''Findable'' criterion of the FAIR data sharing principles <ref type="bibr">[3]</ref>.</p><p>The byte expression of the AUID offers a new way to structure data where each layer is represented by an individual byte of the AUID. This scheme is shown in Fig. <ref type="figure">1(b</ref>). While this leads to a larger number of branches, they are smaller (&#8764; 10 2 entries) with an even distribution of AUIDs across each byte layer due to the quasi-randomness of the hash. These balanced search trees are fast to query and index and also enable efficient parallelization.</p><p>Due to this computational efficiency, AUIDs are used throughout the AFLOW software and web applications when interfacing with the AFLOW database. However, using a hash-based identifier has the major drawback that it is not human-readable as opposed to a material-based identifier such as:</p><p>The data structures presented in this section are thus complementary. The AFLOW database provides both methods of access to its materials entries.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Accessing and searching the database</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.1.">The AFLOW REST-API</head><p>To facilitate accessibility and interoperability to the 40 TB of data generated by the high-throughput AFLOW process <ref type="bibr">[2]</ref>, the backing store is exposed via the AFLOW Data REST-API <ref type="bibr">[2]</ref> in a hierarchical organization. This direct exposure of our results not only grants the end user a high degree of utility via direct access, but more importantly, guarantees data provenance that promotes reproducibility. The hierarchy of the AFLOW Data REST-API categorizes this abundance of information into meaningful high-level classifications, allowing for exploration of self-similar materials that are related by stoichiometric and/or crystallographic properties. Once a selection of materials has been determined, the full range of available properties and procedural data are retrievable.</p><p>The organizational hierarchy of both the underlying data store and the REST-API is project-dependent, as described in Section 2.1. Each project is equivalent to one of the catalogs listed in this section, and in the REST-API is denoted by the project layers ICSD_WEB, LIB1_WEB, LIB2_WEB, and LIB3_WEB. Each project layer contains multiple set layers, which correspond to specific alloy systems in the case of LIB1_WEB, LIB2_WEB, LIB3_WEB, etc. For instance: aflowlib.duke.edu/AFLOWDATA/LIB2_RAW exposes the set layer for binary entries, where each set corresponds to a different binary alloy system, allowing for pairwise atomic species examination. Within each set is the entry layer, consisting of decorated structural prototypes which provide stoichiometric and structural variation for each alloy system. Each entry contains the calculated results for a particular structure and composition, organized as keyword-value pairs. The calculated values of thermodynamic, mechanical, electronic, magnetic, chemical and crystallographic properties can be directly accessed by querying a Uniform Resource Identifier (URI) of the form &lt;server&gt;/&lt;project&gt;/&lt;set&gt;/&lt;entry&gt;/?&lt;keyword&gt;, where &lt;server&gt; is aflowlib.duke.edu/AFLOWDATA, &lt;project&gt; is the appropriate project layer, &lt;set&gt; is the alloy system, &lt;entry&gt; is the structural prototype, and &lt;keyword&gt; corresponds to the materials property of interest. For example, the formation enthalpy per atom of AlCu 2 Mg in the Heusler structure can be accessed at <ref type="url">https://aflow</ref>  set.</p><p>The ability to explore related entries predicated on a multitude of properties leads directly to novel materials discovery and use. The AFLOW Data REST-API disseminates our methods and results, without restriction, to a global research audience in order to promote scientific and engineering advancement.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.2.">AFLUX: The AFLOW search API</head><p>The AFLUX API provides programmatic search functionality for the AFLOW database <ref type="bibr">[12]</ref>. Queries are submitted through a URI and can thus be conducted with commonly available tools such as wget, curl, Python's urllib module, or a web browser. Results are returned in easily parsed formats, such as JSON. This makes AFLUX platformindependent and enables integration into computational workflows written in all commonly used languages.</p><p>AFLUX is a domain-specific language, i.e., no prior knowledge on the underlying database structure or format is needed. It requires only a very small set of operators, shown in Table <ref type="table">1</ref>, which further reduces barriers to using it. An AFLUX call, or a summons, is written in the query portion of the URL: aflow .org/API/ aflux /?&lt; summons &gt;</p><p>The summons is composed of a matchbook and directives: aflow .org/API/ aflux /?&lt; matchbook &gt;,&lt;directive &#8618; &gt;</p><p>The matchbook consists of the search criteria whereas directives guide the form of the results. The AFLUX matchbook. To search for and filter by properties, they need to be added to the matchbook. The following summons lists the band gap (Egap of each returned entry:</p><p>aflow.org/API/aflux /? Egap Or in short:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&lt;matchbook &gt; = Egap</head><p>A full list of properties can be found in the Appendix. Multiple properties can be returned by using the AND (,) operator. For instance, &lt;matchbook &gt; = Egap , spin_atom returns both the band gap and the magnetic moment per atom of each material. Properties can be restricted to a specific value using the BLOCK (parentheses) operator:</p><p>&lt;matchbook &gt; = species (Cr)</p><p>This summons will only return entries containing chromium (Cr). These restrictions also work for numerical values. The matchbook &lt;matchbook &gt; = Egap <ref type="bibr">(1)</ref> returns all entries with a band gap of exactly 1 eV.</p><p>Set logic. AFLUX supports basic logical operators: NOT (!), AND (,), and OR (:). NOT can be used to exclude values from the results. For example, to show entries without Cr, the matchbook should be:</p><p>&lt;matchbook &gt; = species (! Cr)</p><p>The AND operator, previously used to retrieve multiple properties, can also be employed as an intra-property operator. The following summons will return all entries containing Cr and O:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&lt;matchbook &gt; = species (Cr ,O)</head><p>To get all entries that include Cr or Mn, but not necessarily both, the OR operator can be used:</p><p>&lt;matchbook &gt; = species (Cr:Mn)</p><p>When combined with the BLOCK operator, these logical queries can become arbitrarily complex. The following matchbook will return all materials with either Cr or Mn (or both) and at least one chalcogen:</p><p>&lt;matchbook &gt; = species (( Cr:Mn) ,(O:S:Se:Te))</p><p>Value ranges and substrings. Exact matches may not always be desired, especially for numeric quantities like the band gap (Egap). The To get a strictly greater than 1.6 eV match condition, the NOT operator can be combined with the LOOSE operator:</p><p>Using the LOOSE operator without any value will discard null results, which are returned by default. The matchbook &lt;matchbook &gt; = Egap (*)</p><p>will only return entries for which Egap has been calculated.</p><p>The LOOSE operator can also be used for substring searches. The position of the LOOSE operator determines where the substring ought to be located. The matchbooks</p><p>find entries with substrings at the beginning, in the middle, and at the end, respectively. Single quotes (STRING operator) are required for substring searches. They can also be used to avoid collisions with AFLUX operators as in &lt;matchbook &gt; = auid ( ' aflow :1 dcf91da07337d8d</p><p>&#8618; ' )</p><p>Without STRING, AFLUX would search for auid='aflow' OR auid='1dcf91da07337d8d', resulting in an empty response.</p><p>Removing redundancies. Properties in a search result may be ubiquitous and therefore of little use. For example, in the matchbook</p><p>each returned material will have three species, bloating the response with redundant information. AFLUX can suppress the output of a match criterion via the MUTE ($) operator:</p><p>This allows a property to be matched, but not displayed.</p><p>Aliases. Complex AFLUX summons can become cumbersome to type. For example, to search for materials containing a transition metal and a chalcogen, the matchbook needs to contain:</p><p>&lt;matchbook &gt; = species (( Sc:Ti:V :...: Hg) ,(O:S &#8618; :Se:Te))</p><p>where &#8230; contains 25 more species. AFLUX contains pre-defined aliases to simplify this summons to:</p><p>&lt;matchbook &gt; = species ( TransitionMetals , &#8618; Chalcogens )</p><p>A list of all supported aliases can be found at aflow.org/API/aflux/ ?help(aliases). AFLUX directives. Directives determine the size and format of an AFLUX response and provide documentation. The currently supported directives are as follows:</p><p>&#8226; format,</p><p>&#8226; paging,</p><p>&#8226; help.</p><p>The format directive. By default, AFLUX returns search results as minified JSON. The format directive can be used to change the output format:</p><p>Supported formats are json, aflow, and html. Using json formats the response as a human-readable JSON file. The aflow format returns the data in the aflowlib.out format, i.e., as key-value pairs separated by |.</p><p>Finally, html displays the results as an HTML table.</p><p>The paging directive: response size and sorting. The size of the AFLOW repository can lead to very large responses when the search criteria are broad. This can overwhelm systems communicating with AFLUX, especially when a web browser is used. As a result, AFLUX limits the number of entries it returns at once. The paging directive controls this response sizing and enumeration: &lt;directive &gt; = paging ( J ,K)</p><p>This returns the &#119869; th page with &#119870; results per page. For example, the directive:</p><p>&lt;directive &gt; = paging <ref type="bibr">(3 ,100)</ref> can be used to retrieve the results 201-300 (3 rd page with 100 entries per page). The default values are &#119869; = 1 and &#119870; = 64. If &#119869; is larger than the number of available pages, the response will be an empty array or an empty string for json and aflow format, respectively. This can be used as a stop condition in a while-loop for high-throughput retrieval.</p><p>Setting &#119869; or &#119870; to zero results in special behavior: &#119869; = 0 returns all results at once; &#119870; = 0 returns the number of results, regardless of the value of &#119869; . Paging can also be used to sort results. Positive and negative values for &#119869; sort results in ascending and descending order, respectively. This also works for &#119869; = 0 to return all results in descending order: &lt;directive &gt; = paging ( -0)</p><p>When the output format is JSON, using the paging directive returns the results as a dictionary. The MUTE operator can be used to format the data as an array instead:</p><p>&lt;directive &gt; = $paging <ref type="bibr">(3 ,100)</ref> This is the default behavior when paging is not added to the directive. The help directive: AFLUX documentation. Usage information on AFLUX can be obtained using help. The directive: &lt;directive &gt; = help( general ) produces a general help file that serves as the documentation for AFLUX.</p><p>The same output can be found when no argument to help is given or when both the matchbook and the directive are empty. Information on available properties (see Appendix) can be:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>&lt;directive &gt; = help( properties )</head><p>Replacing ''properties'' with a list of keywords will limit the output to the selected properties. All available help files are listed in the following directive:</p><p>&lt;directive &gt; = help(help)</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.3.">AFLOW-OPTIMADE integration</head><p>To facilitate interoperability with other databases in accordance with the FAIR criteria, as well as to broaden accessibility and findability, the OPTIMADE common API <ref type="bibr">[22]</ref> has been implemented within the AFLOW database. OPTIMADE is a common API for searching materials data across a range of repositories, including AFLOW, NOMAD <ref type="bibr">[23]</ref>, Materials Project <ref type="bibr">[24]</ref>, Materials Cloud <ref type="bibr">[25]</ref>, and OQMD <ref type="bibr">[26]</ref>. The full specification is available at <ref type="url">https://github.com/Materials-Consortia/ OPTIMADE/blob/master/optimade.rst</ref>. Similar to AFLUX, searches are encoded within the query part of URL, and can include logic operations such as AND, OR and NOT; searches for different element combinations using the operators HAS ALL, HAS ANY and HAS ONLY; substring comparisons using CONTAINS, STARTS WITH and ENDS WITH; and searches for property values within specific ranges using &gt;, &lt; and =.</p><p>Data is returned in as a JSON object, in a standardized format to maximize interoperability.</p><p>The OPTIMADE API for AFLOW can be accessed at aflow.org/API/ optimade. Available queryable endpoints include ones for structures and calculations. The server takes the OPTIMADE query, converts it to an AFLUX query for the AFLOW database, then reads the AFLUX response and translates it into the standardized format, as illustrated in Fig. <ref type="figure">2</ref>. AFLOW keywords can be included in Optimade queries of the AFLOW database by appending the prefix ''_aflow_'' to the front of the keyword. For example, lead-free halide cubic perovskites with a band gap exceeding 3 eV can be searched for in the AFLOW database using optimade with the following query: aflow.org/API/optimade/ v1/structures?filter=elements HAS ANY ''F'',''Cl'',''Br'',''I'' AND NOT elements HAS ''Pb'' AND _aflow_aflow_prototype_label_relax CONTAI NS AB3C_cP5_221_a_c_b. Full lists of the keywords available to use with OPTIMADE to query AFLOW are available at the info endpoints aflow.org/API/optimade/v1/info/structures and aflow.org/API/ optimade/v1/info/calculations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.4.">The AFLOW search GUI</head><p>Programmatic access to data is essential for integrating the AFLOW data into workflows, but is not suitable for casual searches or for users unfamiliar with coding or the AFLUX language. To fulfill these usage requirements, aflow.org provides a search application, which can be accessed at aflow.org/search/. It consists of four components: the search bar, an element filter, a properties filter, and the search results. The search bar, at the top of the page (see Figs. <ref type="figure">3(a</ref>) and (b)), contains a text field for the elements of interest, the button to conduct a search, and the ''Display'' slider. The text field also contains buttons that can limit the search to the ICSD catalog or extend it to the entire AFLOW database. The ''Display'' slider toggles between having the element and property filters on separate slides and to have them both displayed on a single page, depending on the preferences of the user.</p><p>The element filter is shown in Fig. <ref type="figure">3</ref>(a) and takes shape of the periodic table. Elements can be added to the query by clicking on its chemical symbol or by directly typing them into the search bar. Groups of elements can also be selected using the element group selector. Hovering over each button reveals which elements belong to each group, as demonstrated for the chalcogens in the figure. Next to the element group selectors is a set of logical operators. Elements or element groups can be excluded using the NOT button or combined using AND, OR, or XOR (exclusive or). AND is automatically added when clicking on two elements successively. Parentheses can be used to make these combinations arbitrarily complex. Using the AFLUX <ref type="bibr">[12]</ref> syntax, these logic statements can also be directly entered into the search bar. However, using the buttons can prevent syntax errors because they only accept valid operators, e.g., by disallowing using OR immediately after AND. In either case, the syntax is validated before submitting a search.</p><p>Lastly, the number of species can be set with ''# of species'' selector. Both ranges or an exact number are possible.</p><p>The property selectors can be displayed by clicking on the arrow next to the ''Property Filters'' label or by flipping the ''Display'' slider and is shown in Fig. <ref type="figure">3</ref>. Available properties are organized into different categories, such as electronic, mechanical, or symmetry properties of the relaxed structure. A property can be added to the search query by clicking the ''Add'' button. The text field above the categories and descriptions serves as a search field to provide a convenient way to add a property to the list of filters. Selections can be removed via the ''&#215;'' button at the right-hand side of its card. In the example shown in Fig. <ref type="figure">3</ref>(b), ''Bravais lattice'' and ''VRH bulk modulus (AEL)'' were added to the search query (''space group'' and ''Pearson symbols'' are added by default). The figure also shows that the values for these two properties were set to restricted to retrieve only face-centered cubic compounds with a VRH bulk modulus of 0 -100 GPa. Additionally, the ''Bravais lattice'' property has ''Display column'' unchecked, which will result in it not being displayed in the results table. Since the Bravais lattice will be the same value for every material, this will help focus the search output.</p><p>Fig. <ref type="figure">3</ref>(c) shows the results of this query. The driver of the search is AFLUX <ref type="bibr">[12]</ref> -its summons is shown in the field below the ''Reset Search'' button. The results table is divided into pages and always contains the ENTRY and DATA columns. ENTRY consists of the empirical formula and the AUID of the entry. The link points to the material's entry page, which will be discussed later. The DATA column contains links to the REST-API page of the entry and its aflowlib.out and alowlib.json files. The remaining columns contain the requested properties for which ''Display column'' is switched on -column visibility can be toggled after the search as well. The tables can be sorted by each visible column except DATA and array properties.</p><p>The entry page contains the calculated properties and downloadable files for the material <ref type="bibr">[27]</ref><ref type="bibr">[28]</ref><ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref>. As for the search page, properties are organized into categories to facilitate navigation. Two example sectionsthermal properties and mechanical properties -are shown in Fig. <ref type="figure">3(d)</ref>. Each property has its own card containing its value and unit. The property cards also contain links to their REST-API entries <ref type="bibr">[11]</ref>. The ''Downloadable Files'' section at the bottom of the page is structured the same way and contains various input and output files such as VASP inputs, Bader charges, and outputs of the AFLOW modules used for the entry. The input files and calculation parameter sections fulfill the FAIR ''Reusable'' criterion and help reproduce the results calculated by AFLOW.</p><p>There are also two interactive applets on this page: one for a crystal structure visualization and another one for electronic structures. Fig. <ref type="figure">3</ref>(e) shows the structure viewer tool. It is available for both the conventional and primitive cell of the material and is based on JSmol <ref type="bibr">[32,</ref><ref type="bibr">33]</ref>. Apart from basic structure viewing options, it offers visualizations for the symmetry elements of the structure calculated by AFLOW-SYM <ref type="bibr">[7]</ref>. The ''Symmetry'' tab shows all available operations and displays their types, symbols in the Schoenflies and Hermann-Mauguin notation, whether they contain a translation, their symmetry plane or axis in fractional coordinates, and their rotation angles. The axes and planes, as well as the inversion points (for inversions and rotoinversions) can be displayed, as shown in Fig. <ref type="figure">3(e</ref>). The ''Apply'' buttons start animations that visualize the selected symmetry operation -a supercell of the structure can be displayed as a visual aid. Where available, the applet can also display Bader isosurfaces for each atom in the primitive cell for different cut-off values.</p><p>The other interactive applet shows the electronic structure of the material (see Fig. <ref type="figure">3(f)</ref>). Bands of both spin channels can be traced with a mouse, showing the energy of the band at every step. Zooming in and dragging along the &#119896;-path axis is possible as well. The lines of the density of states (DOS) can also be traced, but the value of the DOS is shown instead of the energy. By default, the total DOS and the projections along the orbitals are shown -the projected DOS of each symmetrically-inequivalent atom is accessible using the ''Switch DOS'' drop-down menu. Static high-quality images of both the total and projected DOS are available through the button underneath the applet.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">The prototype encyclopedia</head><p>Beginning researchers in computational materials science often find it difficult to determine exactly which structure is to be studied. Take as an example a study of the behavior of the ''high-temperature'' (as classified in 1969 <ref type="bibr">[34]</ref>) superconductor Nb 3 Al. Performing computations for this material requires knowledge of its crystal structure, including both the unit cell dimensions and the position of each atom in that cell, but novices in the field may have difficulty finding this information in the literature. The original Nb 3 Al article refers to the &#120573;-W structure and hints that the structure is similar to that of V 3 Sn and Nb 3 Sn. Searching further, our hypothetical researcher might find references to Cr 3 Si as the ''prototype'' of this structure, or it might simply be referred to as a mysterious ''A15'' superconductor. These sources assume that the structure is so well known that it is not necessary to specify the atomic positions or the space group of the structure, and might not even mention that the crystal is cubic. This problem persists across the literature, obfuscating the relevant crystal structure information needed to perform computational studies.</p><p>To address this challenge, the AFLOW Encyclopedia of Crystallographic Prototypes (or Prototype Encyclopedia for brevity) serves as an extensive catalog of crystalline prototypes hosted at aflow.org/ prototype-encyclopedia/. Based on the ''Crystal Lattice Structures'' website started by researchers at the U.S. Naval Research Laboratory's (NRL) Complex Systems Theory Branch (now the Center for Computational Materials Science), the Curtarolo Group at Duke University has continued and extended this effort via the Prototype Encyclopedia. A web page for each structure can be found online, detailing its space group, lattice constants, and atomic (Wyckoff) positions. To assist new researchers and students, each page expanded this basic information by explicitly describing the position of each atom in the structure's primitive cell given the Wyckoff positions of its site. It also lists compounds with the same structure as the prototype, and included many of the common Strukturbericht symbols <ref type="bibr">[35]</ref>. A simple search of the website would then reveal the structure of Nb 3 Al <ref type="bibr">[36]</ref>. In addition to online search capabilities, users can generate geometry files based on these structures with underlying AFLOW prototyping functionality.</p><p>There are currently three parts to the AFLOW Prototype Encyclopedia. Each release resulted in a major update to the website -adding a page for each new prototype -and is accompanied with an article in Computational Materials Science. A summary of each part is as follows:  &#8226; Part 1 <ref type="bibr">[37]</ref>: comprised of 288 structures found on the original ''Crystal Lattice Structures'' page. It also includes a discussion of topics in crystallography (found both online and in the article), namely three-dimensional systems, including crystal systems, lattice types, symmetry operations, and Wyckoff positions. &#8226; Part 2 <ref type="bibr">[38]</ref>: added 302 new structures, including at least one structure from each of the 230 three-dimensional space groups. Additional topics in crystallography are discussed, including enantiomorphic space groups, the Wigner-Seitz description of primitive cells in a periodic system, and descriptions of the seventeen two-dimensional plane groups. &#8226; Part 3 <ref type="bibr">[39]</ref>: added 510 new structures, including the structure of each inorganic Strukturbericht prototype found in the original German publications from 1937 through 1943 and later additions made by Smithells <ref type="bibr">[40,</ref><ref type="bibr">41]</ref> and Pearson <ref type="bibr">[42]</ref>. The article concluded with a list of the Strukturbericht symbols and the first complete index of the inorganic Strukturbericht prototypes.</p><p>To date, there are 1,100 structure prototypes in the Prototype Encyclopedia, and it is expected to continue to grow. Prototype page content. Our hypothetical new user can now easily search for the mysterious Nb 3 Al or ''A15'' structure. Once on the Library's home page (Fig. <ref type="figure">4</ref>(a)), the user enters ''Nb3Al'' in the search box, revealing the Cr 3 Si (A15) prototype (Fig. <ref type="figure">4(b)</ref>). Clicking on the link reveals the page shown in Figs. <ref type="figure">4(c-l</ref>). This page describes the structure in detail:</p><p>&#8226; The title includes the AFLOW label for the structure. The label also serves as the basis for the web page URL (Fig. <ref type="figure">4(c)</ref>). &#8226; A rendering of the structure, using Jmol <ref type="bibr">[32,</ref><ref type="bibr">33]</ref>. Users can rotate and zoom in on the structure, as shown in Fig. <ref type="figure">4(d)</ref>. They may also change the window to see a supercell of the structure. Rightclicking on the image brings up the standard Jmol menu, and the user can even access the Jmol console for more options. &#8226; The chemical formula of the prototype of the structure (Fig. <ref type="figure">4(e)</ref>).</p><p>&#8226; The AFLOW prototype label (Fig. <ref type="figure">4(e)</ref>). The main web page with a different methods to find specific prototypes, including a search bar. (b) An example response page when searching for ''Nb3Al''. The prototype page for Cr 3 Al, the ''A15'' prototype of Nb 3 Al described in the text can be found at aflow.org/prototypeencyclopedia/A3B_cP8_223_c_a.html. Each page contains the following: (c) title, (d) Jmol viewer, (e) prototype designations, (f) a list of compounds/elements exhibiting this structure, (g) notable comments, (h) primitive vectors, (i) basis vectors, (j) references, (k) additional references where the structure was found, and (l) an automatic prototype generator.</p><p>&#8226; The Strukturbericht designation, if any (Fig. <ref type="figure">4(e)</ref>).</p><p>&#8226; The Pearson symbol <ref type="bibr">[43]</ref>, showing the crystal class and the number of atoms in the unit cell (Fig. <ref type="figure">4(e)</ref>).</p><p>&#8226; The space group symbol (Fig. <ref type="figure">4(e)</ref>).</p><p>&#8226; The format of the AFLOW prototype command which was used to generate the structure (Fig. <ref type="figure">4(e)</ref>).</p><p>&#8226; A list of known compounds which have this structure, if any. This list is updated when similar compounds are found in the literature (Fig. <ref type="figure">4(f)</ref>).</p><p>&#8226; Any comments about the structure (Fig. <ref type="figure">4(g)</ref>). This may include links to other structures associated with the prototype or related to the current crystal structure. &#8226; The vectors defining the primitive unit cell of the structure (Fig. <ref type="figure">4(h)</ref>). &#8226; The basis vectors: the position of each atom in the cell is described in terms of the primitive vectors and the Cartesian coordinates, as a function of the lattice parameters and the Wyckoff positions (Fig. <ref type="figure">4</ref>(i)). The atoms are grouped by Wyckoff position and atomic species.</p><p>&#8226; The reference to the structure in the literature, if any, including a link to the publication, if available (Fig. <ref type="figure">4(j)</ref>). &#8226; If the above reference was found in another publication or website, that reference and its associated link are given (Fig. <ref type="figure">4</ref>(k)).</p><p>Finally, each page contains a link to the AFLOW prototype generator for this structure (Fig. <ref type="figure">4(l)</ref>). The user can change the chemistry of the prototype, and enter new lattice constants and Wyckoff coordinates appropriate to that structure. Once entered, this information can be used to update the page showing the new structure. Information about the structure can be obtained in a variety of formats, including popular electronic structure programs and the standard Crystallographic Information File (CIF). Searching structures by crystallographic descriptors. The Prototype Encyclopedia website includes resources which serve as a useful reference for beginning students in crystallography, adapted from published articles <ref type="bibr">[37]</ref><ref type="bibr">[38]</ref><ref type="bibr">[39]</ref>.</p><p>All periodic systems in three-dimensional space belong to one of seven crystal systems, each with the same holohedry (rotational symmetry). Different possible translational symmetries in each class lead to the fourteen Bravais lattices. This information is summarized under the ''Crystal Info'' tab on the home page. This tab provides a brief description of each crystal class, the Bravais lattices included in the class, and the space groups associated with each Bravais lattice. Although each lattice can be described by a large (infinite) variety of primitive vectors, AFLOW makes a standard choice for each lattice <ref type="bibr">[4]</ref>, which we show here. So, for example, if a structure is known to have a body-centered orthorhombic lattice, clicking the ''Orthorhombic'' tag under ''Crystal Info'' and scrolling down to ''Body-Centered Orthorhombic'' will show:</p><p>&#8226; the standard primitive vectors of the lattice, &#8226; an image of the primitive unit cell and the associated conventional unit cell, &#8226; and list the nine space groups which are associated with this lattice.</p><p>Each crystal structure is associated with a Pearson Symbol, which summarizes the crystal system, the Bravais lattice, and the number of atoms in the conventional unit cell of the structure. The ''Pearson Symbol'' tab on the home page is an index of all the crystal structures by symbol. If the body-centered orthorhombic system described above has eight atoms in the conventional cell, then the user finds and selects the ''Orthorhombic Structures'' link on the Pearson Symbol page. The resulting page lists all of the orthorhombic structures currently in the Prototype Encyclopedia. Searching for ''oI8'' (orthorhombic class ''o'', body-centered lattice ''I'', and eight atoms for unit cell) leads to the three crystal structures with this Pearson symbol found in the current database.</p><p>The ''Space Group'' tab indexes the structure by space group. To continue with the above example, the oI8 structured ferroelectric NaNO 2 is in space group &#119868;&#119898;&#119898;2 #44. This structure can be found under the Orthorhombic Space Group link of the Space Group page, along with the other four structures currently in this space group that are currently in the database.</p><p>Similarly, the Strukturbericht tab lists all of the structures which have been given a Strukturbericht designation. The NaNO 2 structure described above is labeled &#119865; 5 5 , and can be found by clicking on the ''Strukturbericht Type F'' link on this web page and scrolling down to this label.</p><p>The ''Prototype Index'' tab lists every structure in the Encyclopedia. This index can be sorted by prototype formula, the number of atomic species, the number of atoms in the primitive cell, the Pearson Symbol, Strukturbericht designation (if any), AFLOW prototype label, space group label, space group number, and structure name (if available). The default formula chemical ordering on this page is alphabetical by element, so we can find our sample structure by ordering by prototype structure and scrolling down to NNaO 2 .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Generic search functionality.</head><p>The prototypes within the Encyclopedia can be searched online via different criteria. In the navigation bar, the users can type the chemical formula, mineral, or common name to search for specific materials/prototypes. This function also searches through the ''similar elements/compounds'' to consider atom decorations beyond the ''prototype material'' for a given prototype. Matching prototypes are returned on a separate page, with hyperlinks to navigate the corresponding entries. By selecting ''Search'' in either the navigation bar or the main web page, users will be redirected to a separate webpage, enabling searches via ''Name'' (same functionality as with the navigation bar), ''Space Group'', ''Pearson Symbol'', and/or ''Strukturbericht Designation''. To further narrow down results, users can type in each search field to perform compound queries. Comparing to the Encyclopedia with AFLOW-XtalFinder. In the ''AFLOW-XtalFinder'' tab, users can upload structures to determine if they match to one of the prototypes in the Encyclopedia. Leveraging routines in the AFLOW-XtalFinder module <ref type="bibr">[8]</ref>, this functionality automatically determines i. the input structure's ideal prototype designation (i.e., label and degrees of freedom), ii. structurally similar prototypes (isoconfigurational) in the Encyclopedia, and iii. symmetrically similar prototypes (isopointal) in the Encyclopedia. Uploaded geometries can be in a variety of common file formats (auto-detected), namely, VASP <ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref>, Quantum ESPRESSO <ref type="bibr">[47]</ref>, FHI-AIMS <ref type="bibr">[48]</ref>, ABINIT <ref type="bibr">[49]</ref>, WYCCAR <ref type="bibr">[7]</ref>, CIF, and ELK <ref type="bibr">[50]</ref>. If an uploaded structure does not match to an existing AFLOW prototype, a message will appear on the web page. Users are encouraged to fill out the displayed submission form providing references/citations to the structure, any notable comments about the structure, and user information (optional). Submission of new structures will be considered for possible inclusion into future iterations of the Prototype Encyclopedia. Future outlook. The Prototype Encyclopedia continues to be expanded, with updates to the database posted periodically. Over 600 new structures are already prepared for future release. We invite users to submit new structures for inclusion in the database. See The AFLOW Prototype Encyclopedia: Part 3, <ref type="bibr">[39]</ref> and AFLOW-XtalFinder at aflow. org/prototype-encyclopedia/xtal-finder.html for more information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Web applications</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">AFLOW Online</head><p>Autonomous computational frameworks such as AFLOW are powerful tools with a rich diversity of features. However, the size of the codebase presents a large barrier to entry to users who wish to use only specific functionality, only use these functions occasionally, or are not experienced with using a command line. A graphical user interface, ideally available online, is more suitable for these use cases. AFLOW Online is such a platform and implements many commonlyused features of the AFLOW software. It can be accessed at aflow.org/ aflow-online/.</p><p>The interface consists of a structure input section, a section containing select AFLOW features, and an output section. The structure input section (Fig. <ref type="figure">5(a)</ref>) contains the structure that is used to perform the AFLOW calculation. It supports all formats that the AFLOW software supports, which includes VASP <ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref>, Quantum ESPRESSO <ref type="bibr">[47]</ref>, ABINIT <ref type="bibr">[49]</ref>, ELK <ref type="bibr">[50]</ref>, FHI-AIMS <ref type="bibr">[48]</ref>, and Crystallographic Information Files (CIF). In a separate tab, a PARTCAR file for the features of AFLOW's Partial OCCupation module (AFLOW-POCC) can be entered <ref type="bibr">[6]</ref>. Magnetic moments can be added to the structure by entering the spins as a comma-separated list into the ''Magnetic moments'' field after activating the ''Magnetic Structure'' button. There are two other settings in this section: the ''Output Type'' drop-down menu can toggle between free text and JSON output (where available), and ''Symmetry Tol''. sets the symmetry tolerance to either ''tight'' or ''loose'', as defined in the AFLOW-SYM module <ref type="bibr">[7]</ref>. Below the structure input section are all AFLOW modules that are available via AFLOW Online. They are categorized based on the type of tasks they perform. Fig. <ref type="figure">5</ref>(b) shows two example categories: symmetry and structure comparison <ref type="bibr">[7,</ref><ref type="bibr">8]</ref>. The symmetry section contains all features provided by AFLOW-SYM: it calculates space groups, Wyckoff positions, Pearson symbols, prototype labels, symmetry groups, information on both direct and reciprocal lattice, and equivalency between atoms.</p><p>The structure comparison section is the interface to AFLOW-XtalFinder <ref type="bibr">[8]</ref>. It provides the options to compare the input structure to the AFLOW database and the prototype encyclopedia, as well as determining unique atom decorations, in a simple buttondrop-down setup. The application for comparing multiple structures is different: the button and drop down select the comparison mode (structure and material), but the structure is not taken from the input structure section. Instead, it has its own section where multiple inputs can be added in separate tabs. Magnetic moments can also be added for each structure using the ''Structures are magnetic'' button nearby.</p><p>To submit a calculation, the button for the feature of interest needs to be clicked -tooltips are available to help users select the appropriate functionality. Some options require additional input, e.g., through a drop-down menu. In the example shown in Fig. <ref type="figure">5</ref>(b) (''Lattice information''), users can choose between real space, reciprocal space, and a superlattice with equal decorations. Pressing the ''Submit'' button will start the calculation and the screen is scrolled to the output section where the results will be displayed once available. Fig. <ref type="figure">5(c</ref>) shows such a submission with the real space lattice information calculated by AFLOW for KFeO 2 . Output is stored for each submitted calculation and can be accessed by clicking on the corresponding tab -in this case, the prior calculation is a conversion of the input structure to the conventional unit cell. Submissions can be removed using the ''&#215;'' buttons.</p><p>AFLOW Online also provides tools to prepare ab-initio calculations. Its structure file manipulation tools allow conversion between different structure file formats, coordinate systems (Cartesian and fractional), and unit cell formats (primitive, AFLOW standard primitive <ref type="bibr">[5]</ref>, standard conventional, Niggli, and Minkowksi). It also has modules for calculating the dimensions of a k-point mesh based on k-points per reciprocal atom or a maximum distance in reciprocal space, and to output the standard k-point path for band structure calculations based on the AFLOW standard <ref type="bibr">[4]</ref>. The format for the k-points are in VASP's KPOINTS file format. Disordered materials can also be set up using AFLOW-POCC <ref type="bibr">[6]</ref>. Based on an input PARTCAR, the ideal supercell size can be determined, and the list of representative structures can be generated in POSCAR format. The latter can be converted into the desired format using the structure file manipulation tools.</p><p>Finally, the AFLOW Coordination Corrected Enthalpies (AFLOW-CCE) module <ref type="bibr">[9,</ref><ref type="bibr">10]</ref> is available as well. It can calculate the corrections to the formation enthalpy for ionic materials calculated from density functional theory and supports the PBE, LDA, and SCAN functionals. It requires knowledge of the oxidation states in the material, which AFLOW can determine automatically. However, users also have the option to provide these numbers directly. The determined oxidation states and the coordination numbers of the cations can also be output separately through the appropriate buttons.</p><p>AFLOW Online is thus a user-friendly tool that provides a variety of features implemented in AFLOW for structural analysis, to support ab-initio calculations, and to correct energies from density functional theory. It is designed to be used in low-throughput settings without installing the entire AFLOW framework and without requiring a command line. Thus, it is particularly suited for new users and for users with little or no scripting experience.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">AFLOW-CHULL Online</head><p>The AFLOW-CHULL web application (aflow.org/aflow-chull) was developed to provide an enhanced, command-line-free platform for the AFLOW-CHULL module and was introduced in Ref. <ref type="bibr">[51]</ref>. The application offers an interactive visualization of binary and ternary convex hulls, enabling the selection and investigation of entries of interest. The ternary convex hull allows mouse-enabled pan and zoom, and has been a useful instructional tool for understanding the stability analysis and grasping higher dimensional hulls with multi-component systems. Hulls can be selected from a responsive periodic table interface that reacts to the input based on the data availability. The number of entries is used as an indicator of the convergence of the analysis <ref type="bibr">[51]</ref><ref type="bibr">[52]</ref><ref type="bibr">[53]</ref><ref type="bibr">[54]</ref>: green (fully reliable, &#119873; entries &#8805; 200), orange (potentially reliable, 100 &#8804; &#119873; entries &lt; 200), red (unreliable, &#119873; entries &lt; 100), and gray (unavailable, &#119873; entries = 0).</p><p>To accelerate loading speeds for repeated queries, a caching layer has been added. It saves the results of each request and loads them up automatically. The caching layer is cleared periodically and fresh calculations employing the most up-to-date database are triggered with a new request. Longer wait times for non-cached queries should be expected. Once the results are retrieved, the application loads the visualization viewport, prompting a redirect to the URL endpoint of the selected hull, e.g., /hull/BeOsV. While the URL is ubiquitous, it is dependent on the cached state at the time the query is submitted.</p><p>Users can select and highlight points on both binary and ternary hulls. When a point is selected, its name will appear within the sidebar. If the selected point is above the convex hull (meta-stable), the hull visualization will highlight the distance to the hull (vertical line) and tie-line/tie-surface directly below formed by stable phases (Figs. <ref type="figure">6(a</ref>) and (b)), corresponding to the decomposition reaction in the thermodynamic limit. For example, in Fig. <ref type="figure">6</ref>(a), the reduced decomposition reaction is Ag 0.375 Au 0.625</p><p>where the distance to the hull is the energy gained from phase separation. If the selected point is on the convex hull (stable), the hull visualization will show the stability criterion <ref type="bibr">[51,</ref><ref type="bibr">55]</ref> (Figs. <ref type="figure">6(c</ref>) and (d)): the distance of the point from the pseudo-hull constructed without it, illustrated by the dotted lines and blue surfaces on binary and ternary hulls, respectively. This distance quantifies the effect of the phase on the convex hull, as well as its susceptibility to destabilization by a new phase that has yet to be explored. The stability criterion helped find two new magnetic phases, the first ever discovered with computation <ref type="bibr">[55]</ref>. Clicking on the name of one of the selected points on the sidebar will bring up the information component (Fig. <ref type="figure">6</ref>(e)), providing a selection of properties, such as the distance to the hull and stability criterion, with a link to the aflow.org entry page (''More Info'') offering the full set of properties for the entry. New to the stability analysis is the calculation of the &#119873; + 1 enthalpy gain <ref type="bibr">[56]</ref>: the distance to the hull for an &#119873;-compound (binaries are 2-compounds) from the pseudo-hull constructed with {1, &#8230; , &#119873; -1}compounds. The descriptor was used to establish the role of disorder in stabilizing multi-component systems <ref type="bibr">[56]</ref>. It was shown that, with an increasing number of species, the average enthalpy gained rapidly  diminishes with respect to the configurational entropy gained. The descriptor corresponds to the formation enthalpy for binaries, and for ternaries becomes the distance from the pseudo-hull constructed with the full set of unaries (&#119860;, &#119861;, and &#119862;) and binaries (&#119860; -&#119861;, &#119861; -&#119862;, and &#119860; -&#119862;). An example visualization of the analysis is provided in Fig. <ref type="figure">7</ref>, where the full AsCoTi convex hull and its {1, 2}-pseudo-hull are shown in Figs. <ref type="figure">7(a</ref>) and (b), respectively. A stable ternary, not included in the pseudo-hull construction, is highlighted in red. The pseudo-hull can be visualized from the Options menu.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3.">AFLOW-ML Online</head><p>The AFLOW machine learning (AFLOW-ML) online application provides a user interface to leverage machine-learning models trained on AFLOW data, as shown in Fig. <ref type="figure">8</ref>. The application can be accessed through a web browser at aflow.org/aflow-ml, or through the AFLOW-ML API described below. The application takes in structural and/or compositional information as an input and outputs predictions, including electronic, thermomechanical, and superconducting properties.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>M. Esters et al.</head><p>This application provides an accessible medium to retrieve machine learning predictions without the need to install specialized software libraries or machine learning packages.</p><p>Currently, AFLOW-ML supports three different machine-learning models: i. The property-labeled materials fragments model [57], plmf, has been trained using data from the aflow.org repository, and predicts properties such as the electronic band gap, specific heat capacities, bulk and shear moduli, Debye temperature, and coefficient of thermal expansion. It is based on structural information, using a Voronoi tessellation to identify bonded or ''connected'' atoms to form a crystal graph, and thus requires a structure file describing the material. Atoms in the graph are labeled with their corresponding elemental properties, and pieces of the graph form the property-labeled materials fragments that form the feature vector for the model. ii. The molar fraction descriptor model <ref type="bibr">[58]</ref>, mfd, predicts vibrational properties such as vibrational free energy and entropy, and is based only on the chemical composition of the material. iii. The AFLOW Superconductor model [59], asc, predicts the superconducting critical temperature of materials based on their chemical composition, and accepts the chemical formula as input.</p><p>Using the online application involves, first, selecting the required model from the three listed and then, posting either a structure file (in the case of the plmf and mfd models) or a chemical formula (for asc) into the ''Input'' box, as illustrated in Fig. <ref type="figure">8(a)</ref>. Structures can be in any format that AFLOW can read, including VASP POSCAR <ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref>, Quantum ESPRESSO <ref type="bibr">[47]</ref>, ABINIT <ref type="bibr">[49]</ref>, ELK <ref type="bibr">[50]</ref>, FHI-AIMS <ref type="bibr">[48]</ref>, and Crystallographic Information Files (CIF) <ref type="bibr">[60]</ref>. Additionally, structures within the aflow.org repository can be imported via the sidebar. The structure is then submitted and the model is run using the ''RUN PREDICTION'' button. When the prediction is complete (which usually takes several seconds, depending on the material, model, and the demand on the server), a card will be displayed containing the predicted property values, as shown in Fig. <ref type="figure">8(b)</ref>.</p><p>The AFLOW-ML API <ref type="bibr">[61]</ref> offers programmatic access to the AFLOW-ML online application, and provides a simplified abstraction that facilitates leveraging powerful machine-learning models. This distills the prediction process down to its essence: from a structure file, return a prediction. Using the API is a two-step process as illustrated in Fig. <ref type="figure">8(c</ref>): first a structure file is posted (i.e. uploaded) to the endpoint &lt;server&gt;/&lt;model&gt;/prediction on the aflow.org server using standard HTTP libraries or dedicated programs such curl or wget, where &lt;server&gt; is aflow.org/API/aflow-ml, and &lt;model&gt; specifies the machine-learning model to use in the prediction (current options: plmf and mfd). For example, the full endpoint for uploading for the plmf model would be aflow.org/API/aflowml/plmf/prediction. When a prediction is submitted, a JSON response object is returned that includes a task id. Second, the results of the prediction need to be retrieved from the /prediction/result/ endpoint on the aflow.org server by appending the task id to the end of the URL, i.e. /prediction/result/{id}/. This endpoint monitors the prediction task and responds with a JSON object that details its status. When complete, the endpoint responds with the results of the prediction, represented as a JSON object containing a key-value pair for each predicted property.</p><p>The AFLOW-ML API can also be accessed using the AFLOW-ML Python client, which is available for download at aflow.org/src/aflowml/. Upon installation, the Python client can be accessed through the command-line interface using the command aflow-ml. Predictions tasks can be submitted by specifying the structure file name and the machine-learning model. For example, a structure in the file test.poscar can be submitted to the plmf machine-learning model using the command aflowml test.poscar --model=plmf, where the model is specified using the flags --model or -m. The results can be saved to a file usng the flag -s or --save, and the output filename can be specified using the flag --outfile (the default filename is prediction.txt). The output format can be set using the -output flag, while the output can be limited to specific predicted fields using the --fields flag. Verbose mode can be activated using the flags -v or --verbose.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Education and outreach</head><p>In addition to the rational data dissemination discussed above, the AFLOW consortium is engaged in global education and outreach activities to train the next cadre of materials researchers. AFLOW Schools. We offer tutorials, virtual and in-person, teaching the structure of our repositories and the strategies to retrieve data to a diverse audience. Particular emphasis is put on the discussion of the features of the AFLOW C++ code and python interfaces, so that they can be easily adapted into workflows. These schools cover the AFLOW code <ref type="bibr">[1]</ref>, DFT, symmetry determination, prototypes, convex hulls, partial occupation, coordination corrected enthalpies, elastic and Gibbs library extensions, database integration, machine learning within AFLOW, prototype crystal finder, and harmonic and quasiharmonic phonon calculations. Videos, slides presentations, and other educational materials are available online at aflow.org/aflow-school/. AFLOW Seminars. We organize biweekly free virtual scientific seminars in the field of materials science and materials physics. Speakers are experts in experimental, theoretical or computational techniques, covering a variety of sub-fields. These seminars attract a global audience from over 50 countries on five continents, providing access to scientific talks that would otherwise require considerable resources to attend. More information can be found online at aflow.org/seminars/.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Summary</head><p>This article describes the aflow.org web presence, an ecosystem of databases and online applications for the AFLOW repository and software. Dissemination of the AFLOW data follows the FAIR principles for data stewardship. Access methods include graphical and programmatic as well as low-throughput and high-throughput retrieval methods, covering a wide range of user experience and usage requirements. aflow.org also aids the generation of new data through its Prototype Encyclopedia -a collection of over 1,100 crystallographic prototypes -and provides tools to create structure files in a variety of formats. aflow.org offers applications for interacting with many AFLOW software modules and machine learning methods, easing the use by students and researchers of all experience levels. Educational efforts such as AFLOW Schools and free seminars further support the effort to make AFLOW and materials science research accessible to a global audience.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>M. Esters et al.</head><p>Wyckoff_site_symmetries <ref type="bibr">[7]</ref> Returns the Wyckoff symmetry of each site for the relaxed structure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Unit: none</head><p>Type: strings Example: Wyckoff_site_symmetries=m..,.-3.,m-3. Wyckoff_site_symmetries_orig <ref type="bibr">[7]</ref> Returns the Wyckoff symmetry of each site for the unrelaxed structure.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Unit: none</head><p>Type: strings Example: Wyckoff_site_symmetries_orig=m..,.-3.,m-3.</p></div></body>
		</text>
</TEI>
