9-Module Chemistry Engine

Formula & Structure Resolver

Resolve any molecular formula, structural formula, or compound name to a complete chemical profile — DBE, functional groups, aromaticity, candidate structures, stability ranking, and 4596 known reactions.

3-in-1 Input Formats
9 Analysis Modules
4596 Known Reactions
1800 Isomers Indexed
DBE Calculator Built-in
📘

What is a Formula & Structure Resolver?

Understanding the three representations of every organic compound

Every organic compound exists simultaneously as three representations: a common name, a molecular formula, and a structural formula. Students frequently face confusion when these three representations appear to describe different things — but they are mathematically identical. The Formula & Structure Resolver runs all three through a 9-module analysis pipeline to prove they describe the same compound and extract the complete chemical profile.

★ CLASSIC EXAMPLE — CINNAMALDEHYDE
Common Name
Cinnamaldehyde
Molecular Formula
C₉H₈O
Structural Formula
C₆H₅-CH=CH-CHO
All three representations → same compound → same chemical properties → same reactions

Core Formulas Used by the Resolver

DBE / Degree of Unsaturation
DBE = (2C+2+N−H−X) ÷ 2
C=carbons, H=hydrogens, N=nitrogen, X=halogens (F,Cl,Br,I). Each ring or π bond contributes 1 DBE. Benzene = 4 DBE.
Hill Notation (Molecular Formula)
C first → H second → A–Z
IUPAC standard: carbon first, hydrogen second, all other elements alphabetically. C₉H₈O is the Hill notation for cinnamaldehyde.
Aromaticity Rule (Hückel)
DBE ≥ 4 AND C ≥ 6
If DBE equals 4 or more and carbon count is 6 or more, a benzene ring is probable. Benzene itself has exactly DBE=4, C=6.
Molecular Weight Estimation
MW = Σ(atomic mass × count)
C=12.011, H=1.008, O=15.999, N=14.007, S=32.06, Cl=35.45, Br=79.904. Cinnamaldehyde MW = 132.16 g/mol.
DBE Interpretation
DBE=0 → saturated (alkane)
DBE=1: one C=C or C=O. DBE=2: diene/alkyne. DBE=4: benzene. DBE=6: benzene + 2 π bonds (cinnamaldehyde-type).
Structural → Molecular Conversion
Strip bonds → count atoms
Remove -, =, ≡, #, brackets. Count all element symbols and subscripts. C₆H₅-CH=CH-CHO → C:9, H:8, O:1 → C₉H₈O.

Formula & Structure Resolver — Live Calculator

Enter compound name, molecular formula, or structural formula below

⬡ 9-Module Resolution Engine — Active
Mol. formula: C9H8O Structural: C6H5-CH=CH-CHO Common name: cinnamaldehyde IUPAC name: 3-phenylprop-2-enal Any format ✓
Formula Parser · DBE Calculator · Aromaticity Detector
Functional Group Predictor · Skeleton Selector · Candidate Generator
Stability Ranker · Reaction DB Lookup · Final Output
📋

How to Use the Resolver

Step-by-step guide with real examples for every input format

1

Enter a Molecular Formula

Type the Hill-notation molecular formula directly. Only element symbols and numbers — no spaces needed.

Input: C9H8O
→ Resolver counts: C=9, H=8, O=1
→ DBE = (2×9+2+0−8−0)÷2 = 6
→ DBE=6 with C=9, O=1 → aromatic + conjugated carbonyl → Cinnamaldehyde
2

Enter a Structural Formula

Type the structural formula with dashes, double bonds, or bracket groups. The parser strips non-atom characters automatically.

Input: C6H5-CH=CH-CHO
→ Parser strips -, = → atoms: C=9, H=8, O=1
→ Hill formula: C9H8O → same as above → Cinnamaldehyde
3

Enter a Compound Name (Common or IUPAC)

Type any common name, IUPAC name, or known alias. The name index covers 400+ aliases.

Input: cinnamaldehyde (or cinnamic aldehyde, 3-phenylprop-2-enal, trans-cinnamaldehyde)
→ All map to: Hill formula C9H8O → same result
4

Enter Ketone Structural Formulas (Acetone Example)

All four of these structural notations for acetone are correctly parsed to the same compound:

C3H6O → DBE=1, O=1 → Ketone → Acetone
CH3COCH3 → strips to C3H6O → same
CH3-CO-CH3 → strips to C3H6O → same
CH3-C(=O)-CH3 → expands (=O) → C3H6O → same
5

Read the 9-Module Output

Click each accordion module to see the detailed reasoning. The best candidate is shown first with a confidence score. Click the Reaction Database Lookup module to see known reactions for that compound.

Output sections: Molecular Formula · Structural Formula · DBE · MW · Functional Groups
Modules: M1 Parser → M2 DBE → M3 Aromaticity → M4 FGs → M5 Skeleton → M6+M7 Candidates → M8 Reactions
6

Use Subscript Numbers (Unicode Support)

The parser handles both normal digits and Unicode subscripts. Either format works:

C₉H₈O (Unicode subscripts) = same as C9H8O (normal digits)
🎯

Aspects of Formula & Structure Resolver

Key capabilities and what makes this tool unique

Three-Format Input

Accepts molecular formula (C₉H₈O), structural formula (C₆H₅-CH=CH-CHO), or compound name (cinnamaldehyde) and resolves all three to identical output. No other format conversion is needed.

🔢

DBE Computation

Automatically calculates Degree of Unsaturation using the formula DBE=(2C+2+N−H−X)÷2. Provides step-by-step working with actual atom counts so students verify the computation manually.

Aromaticity Detection

Applies Hückel's rule proxy: DBE≥4 and C≥6 confirms a benzene ring. Also detects fused bicyclics (naphthalene, DBE=7) and polycyclics (anthracene, DBE=10).

🧪

Functional Group Prediction

Reads atom composition systematically: O=1+DBE→aldehyde/ketone; O=2→acid/ester; N+O→amide/nitro; S→thiol/sulfide; halogens→halides; P→phosphate. Each prediction includes chemical rationale.

🏗️

Skeleton Template Matching

Matches the formula against 14 carbon skeleton templates including: benzene ring+conjugated chain, fused bicyclic aromatic, pyridine/pyrrole heterocycles, cycloalkane rings, and linear/branched chains.

🏆

Stability-Ranked Candidates

Generates multiple candidate structures and ranks them by chemical stability: aromatic resonance (+8 pts), conjugation (+5), database match (+12), diazonium penalty (−20), strained rings (−5).

📚

4596-Reaction Database

Module 8 searches a 4596-entry reaction database covering aromatic EAS, aliphatic substitution, cyclic compounds, heterocyclics, and all major organic reaction classes to return known reactions for the resolved compound.

🔬

Transparent Reasoning

Every module shows its working in collapsible accordion panels. Students can follow exactly how the formula was parsed, how DBE was computed, why a particular functional group was predicted, and why one candidate ranks higher than another.

🌐

Hill Notation Normalisation

Converts every input to IUPAC Hill notation before database lookup — C first, H second, remaining elements alphabetical. This means CH₄O and HOCH₃ and methanol all hash to the same key (CH4O).


Uses of the Formula & Structure Resolver

🎓

Student Exam Preparation

Students can verify whether their structural formula interpretation of a given molecular formula is correct. Particularly useful for JEE, NEET, A-Level, and university-level organic chemistry where formula-to-structure problems appear regularly.

👨‍🏫

Teaching Aid

Instructors can demonstrate how the same compound is represented differently across textbooks, databases, and research papers. The live DBE calculation makes abstract formula-structure relationships visually tangible.

🔭

Research Compound Identification

When only a molecular formula is available from mass spectrometry (MS) data or elemental analysis, the resolver generates ranked candidate structures based on DBE, functional group analysis, and known compound databases.

⚗️

Reaction Planning

Once a compound is resolved, the Reaction Database Lookup (Module 8) immediately shows what reactions the compound undergoes, with reagents and mechanisms — accelerating synthetic route planning.

📊

Cross-Database Verification

Different databases use different naming conventions. The resolver normalises all representations to Hill notation, allowing confident cross-referencing between SciFinder, Reaxys, PubChem, and SDBS entries.

🌿

Natural Product Analysis

Flavonoids, terpenes, alkaloids, and phenylpropanoids often appear only as molecular formulas in NMR papers. The resolver quickly narrows candidate structures, helping identify compound classes in natural product chemistry.

7 Fully Solved Examples

Step-by-step resolution of common chemistry problems

Example 1

Cinnamaldehyde — The Three-Format Problem

A student is given three different representations and must prove they are the same compound.

Input FormatInput GivenHill FormulaResult
Molecular formulaC9H8OC9H8OCinnamaldehyde ✓
Structural formulaC6H5-CH=CH-CHOC9H8OCinnamaldehyde ✓
Common namecinnamaldehydeC9H8OCinnamaldehyde ✓
IUPAC name3-phenylprop-2-enalC9H8OCinnamaldehyde ✓
  • 1Atom count: C=9, H=8, O=1 → Hill: C9H8O
  • 2DBE = (2×9+2+0−8−0)÷2 = 6
  • 3DBE=4 from benzene + DBE=1 from C=C + DBE=1 from C=O (aldehyde)
  • 4Skeleton: benzene ring + α,β-conjugated side chain
  • 5Functional groups: aromatic, aldehyde, alkene, conjugated
Example 2

Acetone — Four Different Structural Notations

Students write ketone structures differently — all are valid and resolve identically.

NotationInputParsed AtomsResult
MolecularC3H6OC=3, H=6, O=1Acetone ✓
CondensedCH3COCH3C=3, H=6, O=1Acetone ✓
Dash notationCH3-CO-CH3C=3, H=6, O=1Acetone ✓
Explicit bondCH3-C(=O)-CH3C=3, H=6, O=1Acetone ✓
  • 1All four notations → same Hill formula C3H6O after stripping structural characters
  • 2DBE = (2×3+2+0−6−0)÷2 = 1 → exactly one C=O bond
  • 3O=1, no halogen, no nitrogen → aldehyde or ketone
  • 4MW = 3×12.011 + 6×1.008 + 15.999 = 58.08 g/mol
  • 5Key reactions: NaBH4→isopropanol, I2/NaOH→iodoform (positive), NH2NH2/KOH→propane (Wolff-Kishner)
Example 3

Benzene vs Cyclohexene — Same Carbon Count, Different DBE

Why C6H6 is aromatic but C6H10 is not — the DBE difference explains everything.

FormulaDBEAromaticityCompound
C6H64✓ AromaticBenzene (6π electrons)
C6H102✗ Not aromaticCyclohexene (1 ring + 1 C=C)
C6H121✗ Not aromaticCyclohexane (1 ring only)
C6H140✗ Saturatedn-Hexane (no rings)
  • 1Benzene: DBE=(12+2−6)÷2=4 → benzene ring confirmed by Hückel's rule
  • 2Each additional H₂ relative to the benzene formula reduces DBE by 1
  • 3DBE=4 is the minimum threshold for a benzene ring in a 6-carbon compound
Example 4

Glucose — When Formula Suggests Multiple Structures

C6H12O6 has two well-known isomers. The resolver correctly identifies both candidates.

FormulaDBECandidate 1Candidate 2
C6H12O61Glucose (aldehyde polyol)Fructose (ketone polyol)
  • 1DBE=(12+2−12)÷2=1 → one C=O group (aldehyde or ketone)
  • 26 oxygens → polyol (multiple –OH groups) + one carbonyl
  • 3No aromatic ring (DBE=1<4) → open-chain or ring polyol
  • 4Resolver returns both glucose (aldohexose) and fructose (ketohexose) as ranked candidates
Example 5

Aspirin — Identifying from Molecular Formula

Given only C9H8O4 in an exam — can you identify the drug?

StepAnalysisConclusion
Atom countC=9, H=8, O=4Hill: C9H8O4
DBE(18+2−8)÷2DBE = 6
AromaticityDBE=6 ≥ 4, C=9 ≥ 6Benzene ring present
Functional groups4 oxygens, DBE accounts for 1 ring + 2 C=OEster + Carboxylic acid
Candidate2-acetoxybenzoic acidAspirin ✓
Example 6

Aniline vs Nitrobenzene — Same C:N Ratio, Different Functions

Nitrogen in organic compounds leads to very different functional groups depending on oxidation state.

FormulaDBEN+O comboFunctional GroupCompound
C6H7N4N onlyAmine (–NH₂)Aniline
C6H5NO25N+2ONitro (–NO₂)Nitrobenzene
  • 1C6H7N: DBE=(12+2+1−7)÷2=4 → benzene ring, no extra π bonds → –NH₂ group
  • 2C6H5NO2: DBE=(12+2+1−5)÷2=5 → benzene ring + one more π bond → –NO₂
  • 3N+O≥2 combination → nitro compound rule correctly identifies nitrobenzene
Example 7

Vanillin — Complex Multisubstituted Aromatic

Vanillin (vanilla flavour) has three different oxygen-containing groups on a benzene ring.

PropertyValueEvidence
FormulaC8H8O3Three oxygens
DBE5(16+2−8)÷2=5
Benzene ring?YesDBE=5≥4, C=8≥6
Extra DBE1 beyond ringOne C=O (aldehyde –CHO)
3 oxygens–CHO, –OH, –OCH₃Aldehyde + phenol + methoxy
ResultVanillin4-hydroxy-3-methoxybenzaldehyde
⚙️

How the Resolver Works — 9-Module Pipeline

Internal logic of every analysis step from raw input to final compound profile

MODULE 1
📥
Formula Parser
Detects input type (name / formula / structural), strips bonds, expands brackets, counts atoms, converts to Hill notation
MODULE 2
🔢
DBE Calculator
Applies (2C+2+N−H−X)÷2. Shows full working with actual atom values. Handles halogens as −1 each.
MODULE 3
Aromaticity Detector
DBE≥4 and C≥6 confirms benzene. DBE≥7 and C≥10 = fused bicyclic. DBE≥3 with heteroatoms = heterocycle.
MODULE 4
🧪
FG Predictor
Reads atom combinations: O=1→carbonyl, O=2→acid/ester, N=1→amine/amide, S→thiol, X→halide, P→phosphate.
MODULE 5
🏗️
Skeleton Selector
Matches DBE, aromaticity, and heteroatom pattern to 14 skeleton templates. Returns most specific match.
MODULE 6
🧬
Candidate Generator
Searches 200+ compound database by exact Hill formula. Falls back to Hill-normalised scan, then to synthetic prediction.
MODULE 7
🏆
Stability Ranker
Scores each candidate: +12 database match, +8 aromatic, +5 conjugated, +3 acid, −20 diazonium, −5 strained ring.
MODULE 8
📚
Reaction DB Lookup
Searches 4596 reactions by formula and compound name. Returns up to 4 known reactions with reagents and mechanisms.
MODULE 9
📊
Final Output
Renders compound name, both formula representations, MW, DBE, functional group tags, ranked candidates, confidence badge, and known reactions.

Input type detection logic: The parser first checks if the input matches a known compound name in its 400+ alias index. If not, it tests for structural formula markers (dash -, double bond =, triple bond , or brackets). If structural markers are absent and the string starts with a capital element symbol followed by digits, it treats the input as a molecular formula. As a last resort, a partial name scan checks if the input is a substring of any known compound name.

🔬

Applying Formula & Structure Resolver in Scientific Research

Real workflows where formula-to-structure resolution accelerates discovery

1. Mass Spectrometry (MS) Unknown Identification

High-resolution mass spectrometry provides the exact molecular formula of an unknown compound from its monoisotopic mass. A researcher detecting a peak at m/z=132.0575 can compute the molecular formula as C₉H₈O (exact mass 132.0575). Entering this into the resolver immediately narrows the candidates to cinnamaldehyde or 1-indanone, with stability scoring favouring the former based on natural product abundance.

MS data: m/z = 132.0575 (M⁺) → formula C9H8O → DBE=6 → aromatic+conjugated aldehyde → Cinnamaldehyde (primary) / 1-Indanone (secondary)

2. Natural Product Dereplication

When screening plant extracts by HPLC-MS, researchers encounter hundreds of molecular formulas. The resolver's functional group prediction quickly classifies compounds — phenylpropanoids (C₉H₈O-type), flavonoids (C₁₅H₁₀O₂-type), alkaloids (N-containing). This enables rapid dereplication against known natural product databases without requiring NMR.

Extract peak C15H12O → DBE=10 → 2×benzene rings → chalcone/flavanone type → search phenylpropanoid library

3. Retrosynthetic Analysis Planning

In total synthesis planning, chemists work backwards from the target compound. By entering the target molecular formula, the resolver identifies functional groups and skeleton patterns that immediately suggest retrosynthetic disconnections. A molecule with DBE=6 and an aldehyde suggests a Wittig or aldol disconnection at the conjugated C=C bond.

Target: C9H8O (DBE=6, conjugated aldehyde) → retro: C6H5-CHO + CH2=CH-CHO (Wittig), or C6H5-CH2CHO + RCHO (aldol)

4. Fragrance Chemistry Structure Verification

Fragrance molecules are frequently described only by their common names across suppliers. Vanillin (vanilla), cinnamaldehyde (cinnamon), benzaldehyde (almond), and furfural (caramel) all have distinct molecular formulas. The resolver cross-verifies that supplier CAS numbers, molecular formulas, and structural drawings are consistent before purchasing for synthesis.

Supplier lists "C9H8O, CAS 104-55-2, cinnamic aldehyde" → resolver confirms all three = cinnamaldehyde ✓

5. Pharmaceutical Intermediate Tracking

Multi-step synthesis routes produce intermediates that may be described differently in different laboratory notebooks. The resolver normalises all representations to Hill notation, enabling consistent tracking even when one researcher writes "C₈H₈O₃" and another writes "methyl salicylate" or "2-HO-C₆H₄-COOCH₃".

Lab notebook A: C8H8O3 · Lab notebook B: methyl salicylate · Lab notebook C: 2-HO-C6H4-COOCH3 → all → same Hill key C8H8O3

6. Metabolomics Database Annotation

Metabolomics studies generate thousands of molecular formulas from LC-MS/MS experiments. The resolver assists in annotating metabolites by predicting likely compound classes from formula and DBE values. C₆H₁₂O₆ compounds are flagged as sugars; C₆H₇N compounds as aromatic amines; C₉H₈O₂ compounds as cinnamic acid derivatives.

Metabolomics hit C6H12O6, DBE=1 → polyol + one C=O → sugar (glucose/fructose/galactose) → annotate as hexose

7. Polymer Chemistry Monomer Identification

Polymers are characterised by their repeat unit formula. Entering the repeat unit formula (e.g. C₈H₈ for polystyrene, C₂H₄ for polyethylene) into the resolver identifies the monomer and its functional group, which in turn suggests the polymerisation mechanism (radical, ionic, or condensation).

Repeat unit C8H8 → DBE=5 → benzene+vinyl → styrene monomer → radical/anionic polymerisation → polystyrene

Frequently Asked Questions

Detailed answers to the most common questions about formula and structure resolution

A molecular formula gives only the count of each element — e.g. C9H8O tells you there are 9 carbons, 8 hydrogens, and 1 oxygen, but nothing about how they are connected.

A structural formula shows the connectivity — e.g. C6H5-CH=CH-CHO reveals a benzene ring, a conjugated double bond, and a terminal aldehyde group. Both describe the same compound (cinnamaldehyde), but the structural formula conveys reactivity and chemical behaviour directly. The resolver proves their equivalence by converting the structural formula back to the molecular formula via atom counting.

DBE (Degree of Unsaturation, also called Index of Hydrogen Deficiency) counts every ring and every π bond in a molecule. The formula is: DBE = (2C + 2 + N − H − X) ÷ 2

Each C=C contributes 1, each C≡C contributes 2, each ring contributes 1, and benzene (1 ring + 3 double bonds, delocalised) contributes 4. DBE matters because it constrains possible structures enormously — a molecule with DBE=6 cannot be a simple alcohol or alkane, and must contain rings and/or multiple bonds.

Example: Cinnamaldehyde C₉H₈O → DBE=(18+2−8)÷2=6 → benzene(4) + C=C(1) + C=O(1) = 6 ✓

Yes, completely. The Module 1 parser performs iterative bracket expansion before counting atoms. For CH3-C(=O)-CH3:

Step 1: Expand (=O) → the = and parentheses are stripped, leaving the O atom counted

Step 2: Strip all -, =, , and # characters

Step 3: Count: C=3, H=6, O=1 → Hill: C3H6O → Acetone

More complex brackets like C6H4(OH)(COOH) are also handled — the multiplier syntax (group)n expands the group n times before counting.

Compounds with the same molecular formula but different structural arrangements are called isomers. The molecular formula alone is mathematically ambiguous. For example, C6H12O6 describes both glucose and fructose — same atom counts, different connectivity and different chemical properties.

The resolver lists all known isomers from its database ranked by chemical stability score. Aromatic compounds rank higher than non-aromatic; natural molecules known to be stable rank higher than reactive intermediates. The highest-ranked candidate is shown first with an ⭐ marker.

To narrow to a unique structure, you need additional information: NMR data, IR peaks, or knowledge of functional groups from a reaction.

Hill notation is the IUPAC-recommended standard for writing molecular formulas: carbon first, hydrogen second, then all other elements in alphabetical order. Examples: C9H8O not O1C9H8; CH4N2O for urea not N2H4CO.

The resolver converts every input to Hill notation as a canonical key before database lookup. This means that CH3COOH, CH3-COOH, acetic acid, and C2H4O2 all hash to the same Hill key C2H4O2, ensuring a single, unambiguous database lookup regardless of how the student originally wrote the formula.

No — oxygen has a valence of 2 (same as carbon in the context of the formula), so it contributes nothing to the DBE calculation. Sulfur behaves the same way. This is why the formula is (2C + 2 + N − H − X) ÷ 2 with no O or S term.

Nitrogen (valence 3) adds +1 to the numerator because each nitrogen can form one extra bond compared to carbon. Halogens (valence 1) subtract −1 each because they each effectively replace a hydrogen.

Common trap: C₆H₅NO₂ (nitrobenzene) — DBE=(12+2+1−5)÷2=5, not 4. The extra DBE(1) comes from the N=O bond in the nitro group, even though two oxygens are present.

From formula alone, the detector cannot perfectly distinguish benzene (C₆H₆, DBE=4) from an isomeric non-aromatic compound. However, the rule DBE≥4 AND C≥6 is the correct threshold for benzene ring detection.

The resolver then checks its compound database — if C6H6 is looked up, the database returns benzene specifically. If the formula doesn't match a known compound, the detector provides the aromaticity probability as a signal to the Candidate Generator, which weights aromatic structures more highly in the stability ranking.

For cyclohexadiene (C₆H₈, DBE=3): DBE=3<4 so the aromatic flag is not set. The detector correctly identifies this as a non-aromatic cyclic diene.

When no exact database match is found, the resolver switches to a prediction mode. It still runs all 9 modules using the formula's intrinsic properties — the compound receives a low confidence rating (LOW — PREDICTED) and the candidate card shows "Unknown compound (formula)".

The DBE, aromaticity, functional group, and skeleton outputs are still accurate because they are derived purely from the atom counts — they do not depend on the compound being in the database. Only the candidate name, structural formula, and molecular weight are approximate in prediction mode.

For example, entering C50H100 still correctly returns: alkane, DBE=1 (one ring possible), MW≈700, no functional groups detected — useful for classification even without a named match.

Module 7 applies a set of chemically justified score modifiers to each candidate's base stability score:

Bonuses: Database match +12 (known real compound), aromatic ring +8 (resonance stabilisation ~150 kJ/mol), conjugated system +5 (delocalisation), carboxylic acid +3 (hydrogen bonding), alcohol +2

Penalties: Diazonium salt −20 (extremely reactive, short-lived), cyclopropane ring −5 (ring strain ~115 kJ/mol)

Scores are capped at 100. A database hit of an aromatic molecule typically scores 95–100, making it the clear top candidate.

Yes. The aromaticity detector has a dedicated rule: DBE≥3 AND heteroatoms present AND C≤6 → aromatic heterocycle likely. This correctly flags pyridine (C₅H₅N, DBE=4), pyrrole (C₄H₅N, DBE=3), furan (C₄H₄O, DBE=3), and thiophene (C₄H₄S, DBE=3).

All four common 5-membered and 6-membered aromatic heterocycles are in the compound database, so entering C5H5N returns pyridine with HIGH CONFIDENCE. The functional group predictor also correctly identifies the nitrogen as heterocyclic amine rather than primary amine because it detects the aromatic pattern.

A condensed formula groups atoms without showing bonds: CH3CH2OH for ethanol. A structural formula shows individual bonds explicitly: CH3-CH2-OH or with explicit bonds CH3-CH2-O-H. A line-angle formula (Kekulé) shows each bond as a line segment.

The resolver handles all three. For condensed formulas, the atom counting still works because no bonds are shown — the parser reads only element symbols and digits. For dash-notation structural formulas, dashes are stripped before counting. For bracket notation like C(=O), the parser expands brackets and strips bond characters.

For compounds in the database, functional group accuracy is 100% because the known compound's actual groups are returned directly. For novel compounds not in the database, accuracy depends on how specific the formula is.

Most reliable: halide detection (halogens are unambiguous), aromatic detection (DBE threshold is highly reliable), and absence of functional groups (saturated hydrocarbons). Less reliable: distinguishing aldehyde from ketone (both C=O, O=1, DBE=1) or distinguishing ester from carboxylic acid (both O=2) from molecular formula alone — this requires structural formula input for disambiguation.

The resolver acknowledges this limitation by listing multiple possible functional groups when ambiguous: "aldehyde or ketone (C=O)" rather than committing to one.

The resolver is designed for organic compounds (carbon-containing). If you enter a formula with no carbon atom, Module 4 returns "Inorganic / no carbon" and processing stops after Module 3.

For organometallic compounds (carbon + metal), the resolver will parse the carbon-containing portion and calculate DBE and functional groups based on the organic ligands, but will not identify the metal complex specifically — it will fall into prediction mode.

For purely organic compounds including all heterocycles, amino acids, drugs, natural products, and polymers, the resolver performs accurately.

A half-integer DBE value (e.g. 0.5, 1.5) indicates the formula describes a radical — a species with an odd number of electrons. For normal, stable organic molecules, DBE is always a whole number. If you calculate DBE and get a non-integer, it means the formula has an odd total valence count, which chemically means an unpaired electron (radical).

This arises in formulas like CH₃• (methyl radical): DBE = (2×1+2−3)÷2 = 0.5. The resolver handles this by rounding to the nearest 0.5 and flagging it as a potential radical species. In practice, most formula inputs from students will give whole-number DBE values corresponding to stable closed-shell molecules.

Module 8 (Reaction Database Lookup) searches the 4596-entry database in two ways: first by matching the Hill formula exactly against the reactant formula field of each entry; second by matching the resolved compound name against the reactant name field.

Up to 4 matching reactions are returned, each showing the reactant, reagent, product, and mechanism type. This directly connects formula resolution to reaction prediction — once you know what compound you have, you can immediately see what reactions it undergoes and what products form.

The 4596-reaction database covers: 1000 aromatic/cyclic reactions + 3596 aliphatic, heterocyclic, and functional group reactions spanning all major organic chemistry topics.

Absolutely. JEE and NEET frequently test the relationship between molecular formula, DBE, and compound identification. Common question types include: "Find the degree of unsaturation of C₉H₈O", "Which compound with formula C₈H₈O is an aromatic ketone?", and "How many isomers are possible for C₄H₈?". The resolver's transparent module output shows exactly the reasoning chain examiners expect.

Particularly useful for: identifying aromatic compounds from formula alone, predicting functional groups for unseen molecules, ruling out compound classes based on DBE, and connecting formula to known reactions. The 7 solved examples above are structured like typical exam problems with step-by-step solutions.