For the last 14 years, scientists have been coming to the Addgene website in search of plasmids. Now, scientists are beginning to see Addgene as a large data set. Addgene has over 65,000 plasmids in the repository, each verified by sequencing, which makes the repository a convenient source of sequence data.
A group of scientists from MIT tapped into this data to learn about trends in synthetic biology and DNA synthesis. They published their results in a paper in Nature Communications announcing a new bioinformatics tool that can predict whether a gene is natural or synthetic just by looking at its sequence.