<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcq="http://purl.org/dc/terms/"><records count="1" morepages="false" start="1" end="1"><record rownumber="1"><dc:product_type>Journal Article</dc:product_type><dc:title>Variant Effect Prediction in the Age of Machine Learning</dc:title><dc:creator>Bromberg, Yana; Prabakaran, R; Kabir, Anowarul; Shehu, Amarda</dc:creator><dc:corporate_author/><dc:editor/><dc:description>Over the years, many computational methods have been created for the analysis of the impact
of single amino acid substitutions resulting from single-nucleotide variants in genome coding
regions. Historically, all methods have been supervised and thus limited by the inadequate
sizes of experimentally curated data sets and by the lack of a standardized definition of variant
effect. The emergence of unsupervised, deep learning (DL)-based methods raised an important
question: Canmachines learn the language of life fromthe unannotated protein sequence
data well enough to identify significant errors in the protein “sentences”? Our analysis suggests
that some unsupervised methods perform as well or better than existing supervised
methods. Unsupervised methods are also faster and can, thus, be useful in large-scale
variant evaluations. For all other methods, however, their performance varies by both evaluation
metrics and by the type of variant effect being predicted.We also note that the evaluation
of method performance is still lacking on less-studied, nonhuman proteins where unsupervised
methods hold the most promise.</dc:description><dc:publisher>Cold Spring Harbor: Perspectives in Biology</dc:publisher><dc:date>2024-07-01</dc:date><dc:nsf_par_id>10523474</dc:nsf_par_id><dc:journal_name>Cold Spring Harbor Perspectives in Biology</dc:journal_name><dc:journal_volume>16</dc:journal_volume><dc:journal_issue>7</dc:journal_issue><dc:page_range_or_elocation>a041467</dc:page_range_or_elocation><dc:issn>1943-0264</dc:issn><dc:isbn/><dc:doi>https://doi.org/10.1101/cshperspect.a041467</dc:doi><dcq:identifierAwardId>2318829; 2310114; 2310113</dcq:identifierAwardId><dc:subject/><dc:version_number/><dc:location/><dc:rights/><dc:institution/><dc:sponsoring_org>National Science Foundation</dc:sponsoring_org></record></records></rdf:RDF>