The first sequenced plant genome, from the small mustard plant Arabidopsis thaliana, was published at the end of 2000. The sequencing of the rice genome is well under way. The sizes of plant genomes vary by a factor of up to 1000, and many important crop plants have genomes that are several times larger than the human genome. To gain insight into the gene toolbox of plant species, numerous large-scale EST sequencing projects have been launched successfully, and analysis procedures are constantly being refined to add maximum value to the sequence data. In addition, an alternative approach to exclude repetitive noncoding DNA and to enrich sequence libraries for gene-containing genomic regions has been developed. This strategy has the potential to deliver information about both genes and regulatory regions outside the transcribed regions.