Using the expression vector λgt11 and immunological detection, cDNA clones of an endopolygalacturonase gene of tomato (Lycopersicon esculentum Mill.) were isolated and sequenced. The 1.6 kb cDNA sequence predicts a single open reading frame encoding a polypeptide of 457 amino acids. The PG2A isoform of tomato fruit endopolygalacturonase was purified and 80% of the amino acid sequence determined. The amino acid sequence predicted by the cDNA sequence was identical to the amino acid sequence of the PG2A isoform. The position of the codon for the N-terminal amino acid of mature PG2A in the open reading frame indicates the presence of a 71 amino acid N-terminal signal peptide which is post-translationally processed. The C-terminus of purified PG2A occurred 13 amino acids before the stop codon in the cDNA suggesting that C-terminal processing of PG2A may also occur. The nucleotide and amino acid sequence data predict a mature protein of 373 amino acids and a polypeptide molecular weight of 40279. The sequence contains four potential glycosylation sites. Northern analysis detected endopolyga-lacturonase mRNA in stage 3 (turning) and stage 6 (red) ripening fruit, but not in leaves, roots, or green fruit of normal cultivars or in mature fruit of the rin mutant.