User:Itub/Chembox property count 1
The following tables were obtained from parsing 4475 chemboxes (13 were excluded because my parser decided that they had syntax errors; see the list at the end).
Property count
[edit]This is a list of how many chemboxes provide a given property.
Formula | 4205 |
MolarMass | 4159 |
CASNo | 3869 |
ImageFile | 3680 |
IUPACName | 3368 |
SMILES | 2750 |
ImageSize | 2490 |
OtherNames | 2487 |
MeltingPt | 2234 |
PubChem | 1954 |
Density | 1878 |
Appearance | 1800 |
Name | 1772 |
BoilingPt | 1537 |
Solubility | 1407 |
ImageName | 1189 |
RPhrases | 750 |
SPhrases | 738 |
ImageFile1 | 613 |
NFPA-H | 593 |
FlashPt | 589 |
MainHazards | 577 |
MeSHName | 506 |
RTECS | 497 |
NFPA-F | 495 |
EUClass | 494 |
OtherCpds | 494 |
NFPA-R | 433 |
ImageSize1 | 413 |
ExternalMSDS | 402 |
ImageFileL1 | 361 |
ImageFileR1 | 361 |
EINECS | 321 |
Reference | 316 |
ImageSizeR1 | 305 |
ImageSizeL1 | 301 |
Solvent | 295 |
SolubleOther | 292 |
Function | 290 |
ImageName1 | 271 |
OtherFunctn | 268 |
ImageFile2 | 232 |
CrystalStruct | 229 |
OtherCations | 228 |
OtherAnions | 227 |
InChI | 221 |
MeltingPtC | 198 |
C | 184 |
H | 182 |
Dipole | 176 |
ChemSpiderID | 173 |
O | 159 |
ImageSize2 | 152 |
Coordination | 149 |
BoilingPtC | 148 |
pKa | 139 |
MolShape | 134 |
Viscosity | 123 |
ImageNameL1 | 122 |
ImageNameR1 | 117 |
CASOther | 115 |
N | 111 |
Autoignition | 107 |
Abbreviations | 97 |
DeltaHf | 80 |
NFPA-O | 74 |
VaporPressure | 62 |
RefractIndex | 58 |
Entropy | 57 |
ImageName2 | 43 |
ChEBI | 42 |
ExploLimits | 35 |
ImageFileL2 | 35 |
ImageFileR2 | 35 |
KEGG | 35 |
ImageSizeR2 | 31 |
UNNumber | 31 |
Boiling_notes | 30 |
pKb | 30 |
ImageSizeL2 | 28 |
LogP | 28 |
RSPhrases | 28 |
Melting_notes | 24 |
Solubility1 | 24 |
Solvent1 | 24 |
Solvent2 | 23 |
S | 22 |
Solubility2 | 22 |
ExplosiveV | 21 |
ShockSens | 20 |
FrictionSens | 19 |
SystematicName | 18 |
DeltaHc | 17 |
LD50 | 17 |
Cl | 16 |
EUIndex | 15 |
Other | 15 |
AdminRoutes | 13 |
HenryConstant | 12 |
ATCCode_prefix | 10 |
ATCCode_suffix | 10 |
REFactor | 10 |
ImageNameR2 | 9 |
SolubilityOther | 9 |
ImageNameL2 | 8 |
P | 8 |
PEL | 8 |
Reactivity | 8 |
Structure | 8 |
Excretion | 7 |
HalfLife | 7 |
ImageFile3 | 7 |
MolarMass_notes | 7 |
Solubility in water | 7 |
Bioavail | 6 |
I | 6 |
SpaceGroup | 6 |
reference | 6 |
DrugBank | 5 |
F | 5 |
InChIKey | 5 |
K | 5 |
Legal_US | 5 |
MSDS | 5 |
MeltingPtCL | 5 |
Solubility3 | 5 |
Solvent3 | 5 |
SpecRotation | 5 |
Beilstein | 4 |
Br | 4 |
FEMA | 4 |
Flammability | 4 |
HenryLaw | 4 |
ImageSize3 | 4 |
MeltingPtCH | 4 |
MeltingPtK | 4 |
Metabolism | 4 |
Si | 4 |
AtmosphericOHRateConstant | 3 |
B | 3 |
CASNos | 3 |
CoE | 3 |
ECNo | 3 |
HeatCapacity | 3 |
JECFA | 3 |
Odor | 3 |
OtherCmpds | 3 |
References | 3 |
Solubility4 | 3 |
SolubilityProduct | 3 |
Solvent4 | 3 |
ThermalConductivity | 3 |
Acidity (pK<sub>a</sub>) | 2 |
BandGap | 2 |
BoilingPtCH | 2 |
BoilingPtCL | 2 |
BoilingPtK | 2 |
CAS | 2 |
Cd | 2 |
ChemSpider | 2 |
Contact | 2 |
Cr | 2 |
Cs | 2 |
ElectronMobility | 2 |
FlashPtC | 2 |
IUPAC | 2 |
ImageNamL1e | 2 |
ImageName3 | 2 |
ImageSzieR1 | 2 |
LambdaMax | 2 |
LattConst_a | 2 |
Legal_UK | 2 |
Legal_status | 2 |
MagSus | 2 |
Na | 2 |
Other names | 2 |
PregCat_US | 2 |
ProteinBound | 2 |
Refractive index | 2 |
U | 2 |
USAFNo | 2 |
pKsp | 2 |
PubChem | 1 |
3DMet | 1 |
ACXNumber | 1 |
AT<link rel | 1 |
ATCCode | 1 |
ATC_Supplemental | 1 |
ATC_prefix | 1 |
ATC_suffix | 1 |
Acidity (K<sub>a</sub>) | 1 |
Ag | 1 |
Bandgap | 1 |
BoilingPtF | 1 |
Boilingpt | 1 |
Boilink_notes | 1 |
C1( | 1 |
CASno | 1 |
CID | 1 |
Ca | 1 |
Canonical SMILES: C1 | 1 |
Canonical SMILES: CCC | 1 |
Carcinogenic | 1 |
ChemspiderID | 1 |
Co | 1 |
CoefficientThermalExpansion | 1 |
CommonName | 1 |
Cp | 1 |
Critical Field | 1 |
CriticalRelativeHumidity | 1 |
Dependency_liability | 1 |
Dielectric constant | 1 |
Diffusion coefficient | 1 |
DispInChI | 1 |
EINEC | 1 |
EINECS/ELINCS | 1 |
EINECSNo | 1 |
Enthalpy Change of Formation | 1 |
Enthalpy of Formation | 1 |
EvaporationRate | 1 |
External MSDS | 1 |
EyeHazard | 1 |
FDA | 1 |
Formual | 1 |
Friction_sensitivity | 1 |
Ga | 1 |
Glass trans Pt | 1 |
Gmelin | 1 |
Hazards | 1 |
IUPACNAME | 1 |
IUPACNAme | 1 |
IUPACname | 1 |
ImageNamel1 | 1 |
ImageSize:1 | 1 |
InC<link rel | 1 |
InChI: | 1 |
InChI: InChI | 1 |
InChi | 1 |
Index of Refraction | 1 |
IngestionHazard | 1 |
InhalationHazard | 1 |
Isomeric SMILES: C1 | 1 |
Isomeric SMILES: CC\C | 1 |
LattConst_b | 1 |
LattConst_beta | 1 |
LattConst_c | 1 |
Lattice Energy | 1 |
MagneticSusceptibility | 1 |
Main Hazard | 1 |
MainHazards<link rel | 1 |
Mass | 1 |
MassRound | 1 |
Melting point | 1 |
MeltingPtC_notes | 1 |
Meltingpt | 1 |
Molar Mass | 1 |
Molar mass | 1 |
Ni | 1 |
OpticalActivity | 1 |
OrbitalHybridisation | 1 |
Other Names | 1 |
Othernames | 1 |
PregCat | 1 |
Pt | 1 |
PubMed | 1 |
Pubchem | 1 |
Purity | 1 |
R-Phrases | 1 |
R-phrases | 1 |
R<link rel | 1 |
RPhrsaes | 1 |
RefractiveIndex | 1 |
S-Phrases | 1 |
S-phrases | 1 |
Section 2 | 1 |
Section 3 | 1 |
Section 4 | 1 |
Seection7 | 1 |
Shock_sensitivity | 1 |
SkinHazard | 1 |
SolubileOther | 1 |
Solubility Constant | 1 |
SolubilityProductAs | 1 |
Solubility_other | 1 |
SolubleOther2 | 1 |
Specific Heat | 1 |
Sr | 1 |
SublimationConditions | 1 |
Supplier | 1 |
Surface Tension | 1 |
Tb | 1 |
Te | 1 |
TriplePoint | 1 |
UNNo | 1 |
Vapor Density | 1 |
Vapor Pressure | 1 |
VapourPressure | 1 |
Water Solubility | 1 |
Wavelength Absorbtion | 1 |
Website | 1 |
Young's Modulus | 1 |
abbreviations | 1 |
align | 1 |
crystal structure | 1 |
density | 1 |
jizz in my eyes Name | 1 |
pH | 1 |
pKa1 | 1 |
pubchem | 1 |
Properties used by only one chembox
[edit]This is a list of all the articles that have a chembox property that is used only once (probably a mistake).
1,4,7-Triazacyclononane | ACXNumber | 1 |
Acrylic_glass | Young's Modulus No analogous field? |
1 |
Alpha-Parinaric_acid | Canonical SMILES: CCC How do we handle multiple SMILES strings for a compound? |
1 |
Alpha-Parinaric_acid | Isomeric SMILES: CC\C How do we handle multiple SMILES strings for a compound? |
1 |
Aluminium_nitride | CoefficientThermalExpansion | 1 |
Ammonium_sulfate | CriticalRelativeHumidity | 1 |
Arsenic_trioxide | ATC_prefix | 1 |
Arsenic_trioxide | ATC_suffix | 1 |
Ascofuranone | Formual | 1 |
Bentiromide | ATCCode | 1 |
Benzoic_acid | 3DMet | 1 |
Benzoic_acid | Gmelin | 1 |
Biuret | External MSDS | 1 |
Brilliant_Blue_FCF | IUPACname | 1 |
Butylated_hydroxyanisole | Section 4 | 1 |
Butyraldehyde | CommonName | 1 |
Cadmium_oxide | Bandgap | 1 |
Cadmium_telluride | Te | 1 |
Calcium_fluoride | EyeHazard | 1 |
Calcium_fluoride | IngestionHazard | 1 |
Calcium_fluoride | InhalationHazard | 1 |
Calcium_fluoride | SkinHazard | 1 |
Calcium_nitride | Ca | 1 |
Carbon_disulfide | Hazards | 1 |
Chromium(III)_oxide | Carcinogenic | 1 |
Citral | Vapor Density | 1 |
Citral | Vapor Pressure | 1 |
Cobalt(III)_hexammine_chloride | Wavelength Absorbtion | 1 |
Cobalt(III)_oxide | MainHazards<link rel | 1 |
Copper(II)_oxide | LattConst_b | 1 |
Copper(II)_oxide | LattConst_beta | 1 |
Copper(II)_oxide | LattConst_c | 1 |
Coumaric_acid | Canonical SMILES: C1 | 1 |
Coumaric_acid | InChI: InChI | 1 |
Coumaric_acid | Isomeric SMILES: C1 | 1 |
Coumaric_acid | Molar mass | 1 |
Cyclohexanone | Othernames | 1 |
D-chiro-inositol | OpticalActivity | 1 |
DMPU | RefractiveIndex | 1 |
DOBU | C1( | 1 |
Dodecane | Diffusion coefficient | 1 |
Dodecane | Surface Tension | 1 |
Erythravine | Mass | 1 |
Ethylamine | Acidity (K<sub>a</sub>) | 1 |
Fumaric_acid | FDA | 1 |
Gallium(III)_phosphide | Ga | 1 |
Glycyrrhizin | ATC_Supplemental | 1 |
Hydroxynaphthol_blue | Water Solubility | 1 |
Hyperforin | Pubchem | 1 |
Iodic_acid | ImageSize:1 | 1 |
Iron(II)_chloride | SolubleOther2 | 1 |
Isoguanine | EINEC | 1 |
Lead(II)_nitrate | MassRound | 1 |
Lithium_cobalt_oxide | R-phrases | 1 |
Lithium_cobalt_oxide | S-phrases | 1 |
Lithium_nitride | Section 3 | 1 |
Mandelonitrile | AT<link rel | 1 |
Mandelonitrile | InC<link rel | 1 |
Mandelonitrile | R<link rel | 1 |
Methacryloyl_chloride | BoilingPtF | 1 |
Michler's_ketone | Melting point | 1 |
N-Formylmethionine | abbreviations | 1 |
Nickel_iodide | Ni | 1 |
Nitrogen_triiodide | Friction_sensitivity | 1 |
Nitrogen_triiodide | Shock_sensitivity | 1 |
Octodecaborane | crystal structure | 1 |
Pentaerythritol | Solubility_other | 1 |
Potassium_aluminium_fluoride | RPhrsaes | 1 |
Potassium_bitartrate | Solubility Constant | 1 |
Potassium_hexachloroplatinate | Pt | 1 |
Potassium_sulfate | Boilingpt | 1 |
Potassium_sulfate | Meltingpt | 1 |
Potassium_sulfate | jizz in my eyes Name | 1 |
Ruthenium(III)_chloride | align | 1 |
Silver_bromide | Dielectric constant | 1 |
Silver_chloride | Ag | 1 |
Silver_oxide | SolubilityProductAs | 1 |
Sodium_chloride | Index of Refraction | 1 |
Sodium_cobaltinitrite | Co | 1 |
Sodium_cyanide | Main Hazard | 1 |
Sodium_hexafluoroaluminate | EvaporationRate | 1 |
Sodium_hydrosulfide | IUPACNAme | 1 |
Sodium_hydrosulfide | Section 2 | 1 |
Sodium_nitroprusside | PregCat | 1 |
Sodium_oxide | Enthalpy Change of Formation | 1 |
Sodium_oxide | Lattice Energy | 1 |
Sodium_perchlorate | Seection7 | 1 |
Sodium_sulfide | density | 1 |
Sorbitan_tristearate | InChi | 1 |
Stearic_acid | Specific Heat | 1 |
Strontium_nitride | Sr | 1 |
Styrene_maleic_anhydride | Glass trans Pt | 1 |
Terbium(III)_oxide | MagneticSusceptibility | 1 |
Terbium(III)_oxide | Tb | 1 |
Terephthalic_acid | TriplePoint | 1 |
Terephthalic_acid | pKa1 | 1 |
Tetracyanoethylene | SublimationConditions | 1 |
Thebaine | Dependency_liability | 1 |
Thiophenol | SolubileOther | 1 |
Tributylphosphine | Boilink_notes | 1 |
Tributylphosphine | DispInChI | 1 |
Triethanolamine | VapourPressure | 1 |
Triethylborane | CASno | 1 |
Trimethylaluminium | Purity | 1 |
Trimethylaluminium | Supplier | 1 |
Trimethylaluminium | Website | 1 |
Tris(2-aminoethyl)amine | EINECS/ELINCS | 1 |
Uranium(III)_chloride | OrbitalHybridisation | 1 |
Yttrium_barium_copper_oxide | Critical Field | 1 |
Zirconium(IV)_chloride | Cp | 1 |
Zirconyl_chloride | MeltingPtC_notes | 1 |
Zirconyl_chloride | Other Names | 1 |
Syntax errors
[edit]These are the articles that produced syntax errors. That can be either because the chembox is wrong or because my parser is wrong (but a failure rate of only 0.3% is not that bad!)
- 2-Phenylhexane
- 3-Thiophene_acetic_acid
- Chromium_carbonyl
- Diethyl_sulfoxide
- Dimethyl_methylphosphonate
- Holmium(III)_chloride
- Miotine
- Palladium(II)_chloride
- Polylysine
- Silicon_dioxide
- Sulfanilic_acid
- Tantalum(III)_aluminide
After updating the parser, the list is reduced to only Sulfanilic_acid and Holmium(III)_chloride (but I haven't updated the tables above yet, but the tables below only exclude these two articles) Both of these articles have what I would call errors in the infoboxes.
- Sulfanilic_acid--my bad! Fixed. DMacks (talk) 02:34, 22 October 2008 (UTC)
Count for main part of the box only
[edit]This excludes all the properties listed under "Section#=".
ImageFile | 3684 |
IUPACName | 3374 |
OtherNames | 2492 |
Name | 1775 |
ImageSize | 1706 |
ImageName | 1189 |
ImageFile1 | 613 |
ImageFileL1 | 361 |
ImageFileR1 | 361 |
Reference | 315 |
ImageSizeL1 | 295 |
ImageSizeR1 | 295 |
ImageName1 | 270 |
ImageSize1 | 243 |
ImageFile2 | 232 |
ImageSize2 | 140 |
ImageNameL1 | 123 |
ImageNameR1 | 117 |
Formula | 93 |
CASNo | 44 |
ImageName2 | 42 |
ImageFileL2 | 35 |
ImageFileR2 | 35 |
MolarMass | 34 |
Abbreviations | 31 |
ImageSizeR2 | 30 |
ImageSizeL2 | 28 |
PubChem | 21 |
SMILES | 19 |
SystematicName | 18 |
MeltingPt | 15 |
BoilingPt | 13 |
Density | 13 |
ImageNameR2 | 9 |
ImageNameL2 | 8 |
Solubility | 8 |
Solvent | 8 |
ImageFile3 | 7 |
reference | 6 |
EINECS | 5 |
ChemSpiderID | 3 |
ImageSize3 | 3 |
InChI | 3 |
References | 3 |
Appearance | 2 |
IUPAC | 2 |
ImageNamL1e | 2 |
ImageName3 | 2 |
ImageSzieR1 | 2 |
Other names | 2 |
Acidity (pK<sub>a</sub>) | 1 |
CAS | 1 |
Canonical SMILES: C1 | 1 |
CommonName | 1 |
IUPACNAME | 1 |
IUPACNAme | 1 |
IUPACname | 1 |
ImageNamel1 | 1 |
InChI: InChI | 1 |
Isomeric SMILES: C1 | 1 |
Molar mass | 1 |
NFPA-F | 1 |
NFPA-H | 1 |
NFPA-O | 1 |
NFPA-R | 1 |
Other Names | 1 |
OtherCpds | 1 |
OtherFunctn | 1 |
Othernames | 1 |
RPhrases | 1 |
RefractIndex | 1 |
SPhrases | 1 |
Section 2 | 1 |
Section 3 | 1 |
Section 4 | 1 |
Seection7 | 1 |
jizz in my eyes Name | 1 |