Euclidean Distance Analysis Enables Nucleotide Skew Analysis in Viral Genomes

Título

Euclidean Distance Analysis Enables Nucleotide Skew Analysis in Viral Genomes

Autor

Ben Berkhout, Formijn van Hemert, Maarten Jebbink, Andries van der Ark, Frits Scholer

Descripción

Nucleotide skew analysis is a versatile method to study the nucleotide composition of RNA/DNA molecules, in particular to reveal characteristic sequence signatures. For instance, skew analysis of the nucleotide bias of several viral RNA genomes indicated that it is enriched in the unpaired, single-stranded genome regions, thus creating an even more striking virus-specific signature. The comparison of skew graphs for many virus isolates or families is difficult, time-consuming, and nonquantitative. Here, we present a procedure for a more simple identification of similarities and dissimilarities between nucleotide skew data of coronavirus, flavivirus, picornavirus, and HIV-1 RNA genomes. Window and step sizes were normalized to correct for differences in length of the viral genome. Cumulative skew data are converted into pairwise Euclidean distance matrices, which can be presented as neighbor-joining trees. We present skew value trees for the four virus families and show that closely related viruses are placed in small clusters. Importantly, the skew value trees are similar to the trees constructed by a “classical” model of evolutionary nucleotide substitution. Thus, we conclude that the simple calculation of Euclidean distances between nucleotide skew data allows an easy and quantitative comparison of characteristic sequence signatures of virus genomes. These results indicate that the Euclidean distance analysis of nucleotide skew data forms a nice addition to the virology toolbox.

Fecha

2018

Identificador

DOI: 10.1155/2018/6490647

Fuente

Computational and Mathematical Methods in Medicine

Editor

Hindawi Limited

Cobertura

Computer applications to medicine. Medical informatics

Archivos

https://socictopen.socict.org/files/to_import/pdfs/3041910.pdf

Colección

Citación

Ben Berkhout, Formijn van Hemert, Maarten Jebbink, Andries van der Ark, Frits Scholer, “Euclidean Distance Analysis Enables Nucleotide Skew Analysis in Viral Genomes,” SOCICT Open, consulta 16 de abril de 2026, https://www.socictopen.socict.org/items/show/2941.

Formatos de Salida

Position: 1872 (51 views)