Skip to main content

Cytosine

C4H5N3O organic

Properties

StateSolid (white crystalline powder)
ColorWhite
SolubilitySoluble in water (7.7 g/L at 25°C); soluble in dilute acids
Melting Point320-325°C (decomposes)
Boiling PointDecomposes before boiling

About Cytosine

Cytosine is a pyrimidine nucleobase with a quietly significant problem: it is the least chemically stable of the four DNA bases. The 4-amino group hydrolyzes spontaneously, deaminating cytosine to uracil at roughly 100-500 events per cell per day in mammalian genomes. That is why uracil DNA glycosylase exists. Without it, every C-G pair that drifted to U-G would lock in a C-to-T transition on the next replication. Pair that with a guanine through three hydrogen bonds and you get the strongest of the canonical Watson-Crick pairs (G triple-bonded to C), which is part of why GC-rich regions melt at higher temperatures and matter so much in PCR primer design. The most consequential modification of cytosine is methylation at the 5-position by DNMT1/3A/3B, producing 5-methylcytosine in CpG dinucleotides. CpG methylation silences gene expression, drives X-inactivation, and underlies genomic imprinting. The TET enzymes oxidize 5mC to 5-hydroxymethylcytosine, then 5-formyl- and 5-carboxylcytosine, giving the cell an active demethylation pathway. The catch: 5mC deaminates to thymine rather than uracil, and a T-G mismatch is harder for repair machinery to recognize. That is why CpG sites are mutational hotspots and why the human genome is CpG-depleted relative to expectation.

Where you'll encounter it

If you have ever designed a bisulfite-sequencing experiment, you have leaned hard on cytosine chemistry. Bisulfite treatment deaminates unmethylated cytosines to uracil while leaving 5-methylcytosine intact, so after PCR every unmethylated C reads as a T. The same chemistry shows up in cancer epigenetics work and in the increasingly common Oxford Nanopore and PacBio modified-base callers, which detect 5mC directly from raw signal. Cytosine also surfaces in clinical pharmacology: cytarabine (ara-C), the backbone AML induction drug, is just cytosine attached to arabinose instead of ribose.

Common Uses

  • Reference standard for HPLC and mass-spec assays of nucleic acid hydrolysates in research labs
  • Substrate for testing cytidine deaminase and DNMT enzyme kinetics in biochemistry studies
  • Teaching material for Watson-Crick pairing, tautomerism, and mutational mechanisms in molecular biology courses
  • Starting material for synthesis of cytarabine, gemcitabine, and other antineoplastic nucleoside analogs
  • Calibration analyte in bisulfite-sequencing and 5mC detection workflows for epigenetics research
  • Component of synthetic oligonucleotides ordered for CRISPR guides, PCR primers, and probe sets
  • Building block in prebiotic chemistry experiments studying RNA-world abiogenesis hypotheses
  • Internal standard in metabolomics panels measuring nucleobase turnover in tissue extracts

Safety Information

Cytosine itself is not classified as hazardous under GHS for ordinary research handling and has no established OSHA PEL. Treat it as a mildly irritating fine powder: wear nitrile gloves, safety glasses, and a dust mask when weighing out solids, and work in a fume hood if generating airborne dust. Avoid ingestion and prolonged skin contact. The deaminated product, uracil, is similarly low-toxicity. Therapeutic cytosine analogs (cytarabine, gemcitabine, 5-azacytidine) are an entirely different category — those are cytotoxic chemotherapy agents that require dedicated handling protocols (USP 800), closed-system transfer devices, and chemo-rated PPE. Do not confuse the parent base with its clinical derivatives.

This safety summary is for educational reference only and may not be complete. It is not a substitute for Safety Data Sheets (SDS), medical advice, or professional chemical safety guidance. Always consult appropriate SDS and qualified professionals before handling chemicals.

Constituent Elements

Frequently Asked Questions

What is the molar mass of cytosine?
111.102 g/mol — it works out from 4 carbons (4 × 12.011), 5 hydrogens (5 × 1.008), 3 nitrogens (3 × 14.007), and 1 oxygen (15.999). Useful when you need to back-calculate concentrations from an A260 reading on a single-base hydrolysate, since cytosine has an extinction coefficient of about 7,400 M⁻¹·cm⁻¹ at 271 nm in neutral solution.
What is DNA methylation and why does it matter?
DNMT enzymes add a methyl group to the 5-position of cytosine in CpG dinucleotides, producing 5-methylcytosine. Methylated CpG islands in promoter regions recruit MBD-family readers and HDAC complexes, compacting chromatin and silencing transcription. It is the mechanism behind X-chromosome inactivation, imprinting at loci like H19/IGF2, and a lot of cancer epigenetics — promoter hypermethylation of tumor suppressors like MLH1 and CDKN2A is a recurring theme.
Why does cytosine deamination cause mutations?
The 4-amino group hydrolyzes off and you get uracil. If uracil DNA glycosylase catches it before replication, no problem — it is excised and base-excision repair restores the C. If replication runs first, the U pairs with A and the daughter strand locks in a T, so the original C-G becomes T-A. With 5-methylcytosine the deamination product is thymine, which the repair machinery cannot easily distinguish as wrong, which is exactly why CpG sites are mutational hotspots.