Peptide Vocabulary Analysis Reveals Ultra-Conservation and Homonymity in Protein Sequences

A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It performs at 60%–70% overall accuracy and greater than 80% accuracy for longer words, and approximately 85% sensitivity on Alice in Wonderland , a considerable improvement on previous methods. When appl...

Full description

Bibliographic Details
Main Author: Derek Gatherer
Format: Article
Language:English
Published: SAGE Publishing 2007-01-01
Series:Bioinformatics and Biology Insights
Online Access:https://doi.org/10.4137/BBI.S415