Selective Profiles among Single or Double Stranded DNA or RNA Viruses Detect their Double or Single Stranded Condition

Article Information

Carlos Y Valenzuela

Affiliation:

Programa de Genética Humana, ICBM, Facultad de Medicina, Universidad de Chile, Santiago, Chile

*Corresponding author: Carlos Y Valenzuela, Human Genetics Program, ICBM, Faculty of Medicine, University of Chile, Santiago, Chile.

Received: February 02, 2024; Accepted: February 09, 2024; Published: March 13, 2024

Citation: Carlos Y Valenzuela. Selective Profiles among Single or Double Stranded DNA or RNA Viruses Detect their Double or Single Stranded Condition. Archives of Microbiology and Immunology. 8 (2024): 84-95.

View / Download Pdf Share at Facebook

Abstract

I developed a test to study the probability distribution of bases in dinucleotides separated by 0, 1, 2 … K nucleotides sites. This analysis define selective profiles of dinucleotides by their sign and selection coefficients, their distance to neutrality measured by a chi-square value and the order of significance of their distance to randomness among the 16 possible values. In double stranded (ds) DNA or RNA an index (index = I, with 5’-3’ sense) dinucleotide participates in evolutionary processes together with three dinucleotides defined by base complementarity and 5’-3’ sense. The anti-sense dinucleotide in the same strand, the parallel (Par, with 3’-5’ sense) and anti-parallel (a-Par, with 5’- 3’ sense) dinucleotides in the complementary strand. Single stranded (ss) RNA or DNA do not have these set of dinucleotides and would not present similarities or differences, among those four dinucleotides. I studied viromes taken from GenBank, with ds and ss RNA and DNA by this dinucleotide analysis. The core matrix has rows with Separations (Sep) of bases and columns with their selective profiles of the Index, Par and a-Par dinucleotides. Profiles distances I-Par and I-a-Par where compared. The ds RNA or DNA showed larger selective profiles differences between I-Par than I-a-Par. Ss RNA or DNA viruses showed non-significant differences between these comparisons. Those ss virus that exist as ds virus in their hosts, as Hindbis and HIV virus behaved similarly to ds viruses. Coli phage PhiX174 an ss DNA virus showed smaller I-Par than I-a-Par selective distances.

Keywords

double or single stranded; DNA or RNA; selective profiles; viruses; replicative cycles; statistical determination; dinucleotides

Double or single stranded articles; DNA or RNA articles; Selective profiles articles; viruses articles; Replicative cycles articles; Statistical determination articles; Dinucleotides articles

Article Details

Introduction

Searching for a measure of the distance to neutrality of genomes, I developed a test, based on the distance to neutrality of dinucleotides whose bases are separated by 0, 1, 2, … K nucleotide sites [1-5]. I used as the measure of the distance to neutrality a chi-square (χ2) test of the random distribution of the first base according to the second base of the dinucleotides [1-5]. This study produces a matrix where rows are the number of nucleotide sites between both bases (Separations = Sep) and whose columns are the order of the distance to neutrality the 16 dinucleotides have. This order follows the value of the χ2 test assigned to a dinucleotide among the 16 values and it may or may not include the sign of the selection coefficient to make more or less discriminant the test. There is a total χ2 value given by the addition of the particular χ2 value for each dinucleotide. The particular χ2 value has one degree of freedom (χ21 ); the total χ2 value has nine degrees of freedom (χ29 ), because it correspond to the 4x4 matrix (4 rows x 4 columns, for the four bases) that loses one degreeof freedom due to fixed marginal (rows or columns) totals. Distances to neutrality resulted enormous. The significanceat the 0.05 level for χ2corresponds to 3.84; I found 377.3(see Table 1) in Sars-Cov-2 virus, for dinucleotide CG, with separation 0 (contiguous bases); the probability (P) is out of tables and programs; with Gaussian approximation (5), P < 10-50. The significance at the 0.05 level for χ2 correspondsto 16.9; I found in human chromosome 21 χ 9 = 1,885,266.8[5] for Sep 0 (P < 10-222,180). Unexpectedly I found periodicity with period 3 in the χ2 value of distance to neutrality in virus and prokaryotes and period 2 and 6 in human chromosomes. This periodicity of the distance to neutrality is incompatible with neutral and nearly neutral evolution [2-5].

Selective values allow the characterization of selective profiles of dinucleotides (hereafter, pair and dinucleotide will be synonymous unless I state other precision). The selective profile of a dinucleotide includes: 1) its selective value, 2) the sign of this selection value, 3) its χ2 value and 4) the order of significance of this value (1° to 16°). The χ2 value is calculated by (Obs – Exp )2/Exp , where Obs and Exp arethe observed and expected numbers of the ith dinucleotide, respectively. The total χ29 value is the sum of the 16 values of the 16 nucleotides. The selection coefficient and its sign is the quotient (Obsi – Expi)/Expi, where positive coefficients (selectively advantageous) apply to dinucleotides with more observed than expected pairs and negative coefficients (selectively disadvantageous) apply to dinucleotides with less observed than expected pairs. Any dinucleotide has its selective profile and this profile (a profile of an index dinucleotide or I profile) compared with the other 15 profiles produces 15 distances (to the I profile).

In double stranded (ds) RNA or DNA mutation or selection (and eventually genetic drift) of one or the two bases of one dinucleotide occur synchronously among the four dinucleotides that a pair implies due to base complementariness and 5’-3’ sense involved in duplication or transcription processes. This does not happen in a single stranded (ss) DNA or RNA. Thus, the test of comparison among selective profiles can discriminate whether the nucleic acid of the organism in study is ds or ss DNA or RNA. In ds DNA as in the human genome, the Index (I) 5’G…A3’ dinucleotide has a contra-sense (C-S) pair 3’A…G5’ (in the same DNA strand), a parallel (Par) pair 3’C…T5’, and an anti-Parallel (a-Par) pair 5’T…C3’ pair in the complementary strand. These four dinucleotides evolve together and synchronously. This coordinated evolution is not possible (physically but not evolutionarily during the viral life cycle) in ss viruses. Since there is only one published strand (the index strand from GenBank) of the nucleic acid, I study the selective profile of these four dinucleotides in the index strand (all pairs in the sense 5’…3’) and determine the similitude or difference among their selective profiles. Regardless (a priori) the reproductive processes of the organism, if there are evolutionary differences in replication or transcriptionbetween the senses of the copy, or differences due to thespecific pair, the test will detect these differences. In ss DNA or RNA, these differences should be impossible because there are not a complementary strand to generate those differences.

This study is on RNA or DNA viruses, however, thedifferences I study are not on the ss or ds physical constitution of viruses well stablished from the infective viral particle (virion). This study deals with on the evolutionary constitution and dynamic behavior of viruses. Either ds or ss viruses have different ds or ss processes along their life cycles. Mutation, selection, genetic drift, evolutionary contingencies occur at any stage of the reproductive or vital viral cycle with different intensities and in mono or bi-stranded constitutions. As for example, ss retroviruses having a retro-duplication of its nucleic acid and existence as a temperate or inserted virus in the DNA of the host may have rather ds (evolutionary) nucleic acid selective profile. A ds virus whose stage ofmutation is mainly at the ss replication process may have ss selective profile (this is very improbable).

Genomes and Methods

Complete genomes taken from GenBank (search in PubMed, Nucleotide, FASTA).

Double stranded DNA: Human adenovirus C strain KF268127.1 Human USA CL 42 1988 (35,931 bp); Human

cytomegalovirus strain AD169 X17403 (229,354 bp); Escherichia phage 2 vB EcoM PhAPEC2 KF562341 (167,318 bp); His2 virus NC_007918.1 (16,067 bp).

Single stranded DNA: Penaeus monodon hepandensovirus 4 NC_011545.2 (6,310 b); Escherichia phage phi-X174 NC_001422 (5,386 b); Phage M13 Enterobacteria NC_003287.2 (6,407 b).

Double stranded RNA: Marmot picobirnavirus strain HT1 KY855428.1 (4,579 bp); Gentian Kobu virus NC_020252.1 (22,711 bp).

Single stranded RNA: SARS-Cov-2 LR757998.1 Wuhan (29,866 b); Sindbis virus NC_001547.1 (11.703 b); HIV MN691959.1 isolate ACH2-NFLMDA13 B1 (USA) (9,943

b). E coli genome strain AVS0967 NZ_CP124398 (5,097,505 bp). The number of nucleotides used in the analyses may be less than the number in GenBank because the base identification is not complete. I did not study positive and negative condition of the viral strand to restrict the analysis.

Method

  • From this file the sets of dinucleotides whose bases are separated by 0, 1, 2, … K sites are constructed.

  • Each of these sets yield a matrix where rows are the four bases A, T, G, and C for the first base (downstream) and columns are the four bases A, T, G, and C for the second base (upstream). This is a matrix of 4x4 for the 16 possible For each of the 16 dinucleotides with bases separated by K sites a χ2 test describes the distance to neutrality. The expected number of each dinucleotides (Exp) is calculated by (fB )x(fB )xN, where f is the The nucleotide sequence of a genome, chromosome or nucleic acid segment constitute a primary file obtained from GenBank. From this file the sets of dinucleotides whose bases are separated by 0, 1, 2, … K sites are constructed.

  • Each of these sets yield a matrix where rows are the four bases A, T, G, and C for the first base (downstream) and columns are the four bases A, T, G, and C for the second base (upstream). This is a matrix of 4x4 for the 16 possible For each of the 16 dinucleotides with bases separated by K sites a χ2 test describes the distance to neutrality. The expected number of each dinucleotides (Exp) is calculated by (fB )x(fB )xN, where f is the observed frequency of B, a generic base (A, T, G or C), u and d denote upstream and downstream, respectively and N is the total number of observed (Obs) dinucleotides for this K. The χ2 (one degree of freedom = df) values for each dinucleotide is [(Obs -Exp )2/ Exp ), i going from 1 to 16. The total χ2 , with 9 df is the sum of the 16 values plus a minimal value corresponding to the difference of the Obs number of each dinucleotide with the total. This specific difference is small for each dinucleotide and neglected in present calculations.

  • The selection coefficient with its algebraic sign proceeds from ((Obsi-Expi)/ Expi)
  • Data are presented in a matrix where the rows are K (Sep = separations) and columns are selective values ordered

    according to their distance to neutrality (χ2 ) or order of significance (Sig), sign of the selection coefficient and the value of the selection coefficient (co-sel). I stopped K at 33 separations, including 0 Sep. These selective values constitutes the selective profile of this pair. I used only four Seps to short the analysis.

  • The estimates of the values of the profile is directly the figures just From these estimates the difference between the order of significance, selection coefficient and the sign of selection between the parallel (Par) or the anti-Parallel (a-Par) and the Index dinucleotide follows by direct subtraction. I did not include the (χ2 ) value in the estimation of selective differences because genomes differ in the number of dinucleotides, and this value depends greatly of this number, nor I included the profile of the contra-sense pair.
  • In the present study separations go from 0 (contiguous bases) to 3 sites of separations.
  • Study these traits in Table 1.
  • Some ad hoc statistical considerations and tools (modified from (5))

    I analyzed all the genome dinucleotides of an organism, so parameters are known, statistical tests are not necessary. I applied statistical tests to show the robustness of the analyses and sense of differences. The distance of a selective profile between two types of dinucleotides has mean (M), variance(V) and standard deviation (SD). For example, the variable distance of Par or a-Par dinucleotides to the Index (ID) dinucleotide. These parameters consider all the possible distances occurring with equal probability. In Significance (Sig) comparisons, only absolute (differences without + - signs) distances are used. An algebraic full demonstration of estimates of M, V and SD is out of the scope of this article (OSA). I offered an intuitive but complete calculation (C.Y. Valenzuela, submitted for publication). The expected meandifference in Sigs between the Index profile and Par or a-Par profiles is M = 5.667 and its SD is 3.6362. With these parameters, I tested the observed figures with one tailed t tests (with the used sample sizes it does not differ from a z test) and equal variances. The initial difference between selective coefficients (co-sel) is calculated with direct figures; this difference have a mean equal to 0 (the set of I dinucleotides is the same of the set of Par and a-Par dinucleotides). The SD must approximate that of a uniform distribution theoretically between -1.0 to +1.0 (limits of co-sels); but since selection is strongly dependent on evolutionary contingencies, I used the observed variance to construct the test because variances proceed from the same source. The definitive comparison test must be on absolute differences to avoid the difference between means that is 0, what implies distance equal to 0. The χ2 value is always positive and hides the direction of selection. I ordered those values according to a cline from the most positively to the most negatively selected or vice versa. The previous analysis (C. Y. Valenzuela, submitted for publication) considered only the magnitude of the deviations to neutrality the present one includes the sense of selection; that is why the Table in reference (C. Y. Valenzuela, submitted for publication) is different with the present one. It is irrelevant for the order of dinucleotides whether they begin from the most negative or the most positive pair. The matrix ordered according the sign of selection makes unnecessary the comparison of signs of dinucleotides, because this is implicit.

    The χ2 values of these studies are often enormous and out of tables and programs. I approximated the probability values by the normal or Gaussian equivalent; this is not valid for small degrees of freedom (df), however 9 df may be approximated with a moderate error. The mean and variance of a χ2 distribution are the degrees of freedom (df) and twice df, respectively. I made equivalent one point decimal for each two SD from the expected mean (df). With one df the application is no as valid as with nine df, but with enormous figures it yields an approximation to a qualitative intuitive evaluation.

    Results

    Table 1 shows the matrix: Selective profiles (columns)vs Separations (Sep) for this strain of the SARS-CoV-2 virus.

    Only 4 Seps are in this Table modified from unpublishedrusult (C. Y. Valenzuela, submitted for publication). Here,the order of significance include its sign of selection. Themost significant pair is CG (-) at Sep 0 with χ2= 377.3, whose probability value is P < 10-133. There are dinucleotides where the Index has the same Par and a-Par dinucleotide: AA, TT, GG and CC whose Par and a-Par are TT, AA, CC and GG, respectively. Other dinucleotides have different Par and equal a-Par as AT, TA, GC and CG whose Par pairs are TA, AT, CG and GC, and whose a-Par pairs are AT, TA, GC, and CG respectively. I analyzed both subsets of pairs only in the comparison of SARS-CoV-2 with E. coli to show a sharp difference in selective profiles with a known big ds DNA genome (see below).

    Dinucleotides with different Par and a-Par pairs are AG, AC, TG, TC, GA, GT, CA and CT pairs. I studied this last group of pairs and the first four separations (0, 1, 2 and 3). To explain friendly the method, the result with SARS-CoV-2 is used. Remember selective profiles include four traits: 1) the order of significance (Sig, 1 to 16) given by 2) the χ2value, 3) the selection coefficient (co-sel), with its 4) sign of selection (+ or -). I compared three of these four traits, butnot χ2values between both species due to their differencein genome sizes; the comparison of selective signs is direct and implicit. The difference in selective traits between Par or a-Par and the Index dinucleotides is direct from Table Let us choose TG as the index pair, with 0 Sep, it has Sig = 16° (place), co-sel = +0.377 and Sign = +. Its Par pair is AC with Sig = 14°, co- sel = +0.231 and Sign = +; its a-Par pair is CA with Sig = 15° (place), co-sel = +0.269 and sign = +. The absolute difference in these three traits with the Index is for I-Par pair: 2° Sig, 0.146 co-sel and (+ +) Sign, respectively. The absolute difference in these three traits with the Index is for I-a-Par pair: 1° Sig, 0.108 co-sel and (+ +). Table 2 presents the selective profile comparisons of the Index, Par, and a-Par for the 8 Index dinucleotides having different Par and a-Par pairs, for the four Separations. The Significance distance I-Par was smaller than that of the I-a-Par, but the difference was not significant (minimal significance is P = 0.05). As I mentioned the comparison of signs of selection in the following tables is not included. The differences in the

    Table 1: Significance vs Separations of dinucleotides from SARS-CoV-2. Four Separations

    Table icon

    Sep = n° sites of separation; dinuc[s] = dinucleotide with its sign of selection; χ2 = chi-square value for the dinucleotide at the left hand; co-sel = selection coefficient selection coefficient followed the same pattern as Significance, with a higher probability of significance (a less significant result). It is remarkable that this result is different from a previous study (C.Y. Valenzuela, submitted for publication) with distances of the chi-squared value without specifying the sign of selection (OSA).

    Before presenting the study of twelve viruses, I present the dinucleotide analysis of the E. coli genome to establish clear differences with a rather big (5 million pb) ds DNA. This is necessary because there is not another researcher working in this field and the reader must know what happens with big ds genomes. Table 3 presents the dinucleotide spectrum of the genome analysis of E coli with 4 Seps.

    Table 2: SARS-CoV-2. Index dinucleotides with different Par and a-Par pairs. Four Separations

    Table icon

    Par = parallel; a-Par = anti-parallel; dinuc[s] = dinucleotide; co-sel = selection coefficient; D Ind-Par = distance Index-Par; D Ind-a-Par = distance Index a-Par; D Si = difference in Significance; D Se = difference in co-sel; AB Si = absolute difference in D Si; AB Se = absolute difference in D Se; SD =standard deviation; P t test = probability under a one tailed t test. with equal variance for the comparisons of D Si and D Se. respectively

    Table 3: Significance vs Separations of dinucleotides from E. coli. Four Separations

    Table icon

    Nomenclature as in Table 1 The difference between SARS-CoV-2 and E. coli is evident at the first inspection and comparison between Table 1 and Table 3. Significances in E. coli are 50 or more times those of SARS-CoV-2. The χ2 value of the first dinucleotide GC+ is 24,073.2, this implies a probability (Gaussian approximation; no as valid as in χ2 test) less than 10-4,250. Dinucleotides with equal Par and a-Par complementary dinucleotides shows sharp differences. In SARS-CoV-2 AA and TT, and GG and CC separate at the places of significances (Sigs). For example at Sep 0, AA is at 12° Sig while TT is at 9°, the difference in Sig is 3°, at Sep 1 the difference is 7° (3° and 10°, respectively), at Sep 2 is 3° (5° and 2°, respectively), and at Sep 3 it is 3° (12° and 15°, respectively). In E. coli AA and TT are contiguous (distance 1°) in the four Seps. GG and CC behave equally, while in SARS-CoV-2 the distances between both are 2°, 4°, 3° and 4° for Seps 0, 1, 2 and 3, respectively, in E. coli they are 1°, 2°, 1° and 1°, respectively. It is evident that in a ds genome (E. coli) complementary dinucleotides behave different as in SARS- CoV-2, a ss genome, where complementary nucleic acid do not exist. (However, it is remarkable that the difference is not as big as expected; I shall discuss that). This occurs equally with the human genomes (C. Y. Valenzuela, submitted for publication). I remember that the expected mean difference in Sig profile is 5.667 and the standard deviation is 3.636. The following analysis in E. coli is similar to that presented in Table 2 for SARS-CoV-2 on the distance of selective profiles from Par and a-Par dinucleotides to their Index; these Index pairs have different Par and a-Par dinucleotides. Table 4 presents it. In this Table the mean Sig distance I-a-Par is 1.06 (contiguous dinucleotides) while in I-Par is 6.06 (t test P < 10- 9) and in Sel distance the mean is 0.0679 for I-Par and for I-a- Par is 0.0013 (P < 10-13). Selective profile of the 5’-3’ a-Par dinucleotide is practically the same as the selective profile of the Index. The 5’-3’ parallel dinucleotide differs from the Index. Remark that it is not possible to analyze the true 3´-5´ parallel dinucleotide, because we have only one strand from GenBank. The correction of this possible error is, at present OSA. Statistical tests where done as compliance of statistical norms, because values of differences in Sigs and co-sel are significant by the simple inspection. Differences in co-sel in the I-Par comparisons are at the order of tenths or hundredths, and the differences in I-a-Par are at the order of thousandths or ten thousandths. In Table 4 only absolute distances I-Par and I-a-Par are considered.

    Table 5 for DNA and Table 6 for RNA viruses present the comparisons for 12 viruses with the same format of SARS- CoV-2 and E. coli. Only mean, SD and t test probability appear. The single or double stranded DNA or RNA condition

    Table 4: E. coli. Index dinucleotides with different Par and a-Par pairs, four Separations

    Table icon

    Nomenclature as in Table 2. D Ind-Par = absolute distance Index-Par; D Ind –a-Par = absolute distance Index-a-Par is in the first row of each set of viruses. Names are those used in current language and in GenBank and not necessarily scientific species nomenclature (excepting SARS-CoV-2 and E.coli, see accession numbers of genomes).

    These results were expected; ds DNA viruses showed differences between I-Par larger than differences between I-a- Par. Cytomegalovirus had the largest difference very probably because it is the largest virus. On the contrary, ss DNA viruses disagreed with this tendency. Penaeus monodon did not present significant difference between I-Par and I-a-Par

    Table 5: Means, standard deviations and t test probability for distances Index-Par and Index-a-Par. DNA viruses

    Table icon

    Nomenclature as in Table 2

    Table 6: Means. standard deviations and t test probability for distances Index-Par and Index-a-Par. RNA viruses

    Table icon

    Nomenclature as in Table 2

    differences. However and unexpectedly Coliphage phi-X174 presented an inverse difference of the differences: I-Par difference is significant smaller than the I-a-Par difference.

    Table 6 shows RNA viruses. In ds RNA viruses Marmot picobirnaviridae virus presented the expected larger difference in I-Par pair, but no significantly, while Gentian Kobu virus did. In this case, the small size of the viruses may explain the non-significant result. As expected the ss RNA virus SARS-CoV-2 did not presented difference between I-Par and I-a-Par, while Sindbis virus presented a small but significant difference and HIV virus a large and significant difference.

    I mentioned these distances to neutral evolution were periodical. It seems important to examine whether these viromes present periodicity because this is the first time the method applies to a set of viruses. SARS-CoV-2 is periodical (C. Y. Valenzuela. submitted for publication) the 12 viromes and E coli are in Table 7. The pivot distance (the largest) of the period is in italics it follows and precedes smaller values. Asterisks show the element of the series that do not accomplish partially or completely this criterion. The first three values and sometimes the six ones are influenced by the first and second values that have regulatory dinucleotides. Those values less than 17 are not significant and do not alter necessarily the periodicity in this set of values. This is because the variance of the test is 2xdf = 18.0 and the SD is 4.243; random fluctuation near those values blurs the periodicity. Besides the χ2 value is less significant in ss DNA or RNA and smaller viruses. Regardless these restrictions the periodicity is impressive. Moreover, we see that single or double stranded viruses that may incorporate to the host dsDNA present a clear periodicity similar as the host one without exceptions (HIV presents only one exception at Sep 23).

    Table 7: Periodicities of distances to evolutionary neutrality in viromes and E. coli genome

    Table icon

    In italics the largest distance among the triplet of the period measured by the total χ29. For names of viruses see Genomes. *figure that does not fit partially or completely the condition of the largest distance

    Discussion

    This is the first time this statistical tool is used to search for the single or double stranded nature of RNA and DNA viruses and their implications on evolution. Results are according to the theoretical expected behavior of ds or ss viruses with some important exceptions. Escherichia phage phi X174, a ss DNA virus presented a more similar selective profile between Indices and Par pairs than between Indices and a-Par pairs. A plausible explanation is the fact that this virus replicates by using a negative DNA strand after constructing a dsDNA [6, 7]. The Marmot Picobirnaviridae a dsRNA virus [8] shows a non-significant larger selective profile difference between I-Par than I-a-Par, this is perhaps it is a very small virus (4.579 bp). However, Gentian Kobu sho-associated virus also a dsRNA virus [9] according to the Oxford database shows a big significant difference as expected; however, other authors present it as an ssRNA virus [10]. According to my analysis, this virus behaves as an evolutionary or replicative dsRNA virus. Arguments in favor or against are present in these analyses and Tables but their study is OSA. In the group of ssRNA, viruses SARS-CoV-2 did not presented differences between I-Par and I-a-Par pairs selective profiles, while Sindbis virus and HIV showed a significant difference where I-Par differences were larger than I-a-Par differences. This was more significant in HIV than in Sindbis virus perhaps for HIV is a retrovirus that exists as a double stranded DNA segment incorporated in the host DNA; however, Sindbis virus needs ds RNA phase in its replication cycle [11]. These data, methods and analyses were presented in international and national congresses [12, 13].

    The present analyses indicate that the classification in single or double stranded DNA or RNA virus based on the virion physical constitution is useful but it does not include evolutionarily functional double or single stranded constitution mostly produced at the replicative or transcription stage of the viral vital cycle. This new conceptualization seems useful and offers a different understanding of the viral behavior and evolution, which is also useful in epidemiological studies. However, to analyze more precisely these hypotheses need a deep research (OSA).

    As for the impressive periodicities, two hypotheses may be advanced. The structural hypothesis proposes these periodicities of the distance to neutrality originated in a basic property of nucleic acids that has different structural stability in triplets of nucleotides; triplets for the genetic code are a result of adaptation to this primary periodicity. The functional hypothesis proposes the periodicity has evolved as the genetic code installed. During and after this installation the genome segments (coding or non-coding for proteins) optimize an organization that dances at the rhythm of trinucleotides. Arguments in favor or against of these hypotheses are OSA.

    Conclusion

    The comparisons of differences in selective profiles between an index (I) dinucleotide with its complementary parallel (Par) and anti-parallel (a-Par) dinucleotide informs on the double or single stranded (ds or ss) nature of the nucleic acid of any organism. The foundation of this discrimination happens because ds DNA or ds RNA have parallel and antiparallel strands that evolve together, but ss DNA or ss RNA do not. The single and double stranded evolutionary functional constitution of viruses appears as a useful tool to understand the life cycle of viruses and their epidemiological properties. The article demonstrates that this methods works without exceptions in the 13 examined genomes.

    Ethics approval and consent to participate

    Not applicable.

    Consent for publication:

    Not Applicable.

    Availability of data and Materials:

    The genomes’ information is available in GenBank as cited. Any geneticist, student or statistician who knows the current informatics languages can construct these computer programs and perform these analyses.

    Competing Interests:

    I do not have any competing interest.

    Funding:

    This work and the article did not have extraordinary funds.

    Author’s Contributions:

    CYV carried out all the work described in this article.

    Acknowledgements:

    To Francesco M Scudo and Ching Chung Li my first teachers of population genetics. My students Franccesca Soriano, Francisco Rosas and Matías Rozas helped me in processing data.

    References

    1. Valenzuela Non-random pre-transcriptional evolution in HIV-1. A refutation of the foundational conditions for neutral evolution. Genet Mol Biol 32 (2009): 159-169.
    2. Valenzuela Internucleotide correlation and nucleotide periodicity in Drosophila mtDNA: New evidence for panselective evolution. Biol Res 43 (2010): 481-486.
    3. Valenzuela CY. Heterogeneous periodicity of drosophila mtDNA: new refutations of neutral and nearly neutral Biol Res 44 (2011): 283-293.
    4. Valenzuela CY. The structure of selective dinucleotide interactions and periodicities in D melanogaster Biol Res 47 (2014):1-12.
    5. Valenzuela Selective intra-dinucleotide interactions and periodicities of bases separated by K sites: a new vision and tool for phylogeny analyses. Biol Res 50 (2017): 3-16.
    6. Venclovas Genome Replication of Bacterial and Archaeal Viruses. In Bamford DH. Zuckerman M (ed), Encyclopedia of Virology, 4th ed, Academic Press Cambridge, Massachusetts (2021) :429-438.
    7. Cann AJ. Viral Replication Cycle, p. 382-387 In Mahy BWJ, Van RegenmortelMHV (ed), Encyclopedia of Virology, 3rd ed, Academic Press, Cambridge, Massachusetts (2008).
    8. Kashnikov AY, Epifanova NV, and Novikova Picobirnaviruses: prevalence, genetic diversity, detection methods. Vavilovskii Zhurnal Genet Selektsii 24 (2020): 661-672.
    9. Schoch CL, et al. NCBI Taxonomy: a comprehensive update on curation, resources and Tools, Database (Oxford) (2020): 1150353.
    10. Debat H, Bejerman N. Two novel favi-like viruses shed light on the plant-infecting koshoviruses. Arch Virol 168 (2023): 184.
    11. He L, Piper A, Meilleur F, Myles DAA, Hernandez R, Brown DT, et The Structure of Sindbis Virus Produced from Vertebrate and Invertebrate Hosts as Determined by Small-Angle Neutron Scattering. J Virol 84 (2010): 5270–5276.
    12. Valenzuela CY. Selective dinucleotide profiles as tests for mono or bi stranded DNA or RNA. First Latin American Congress on Evolution, 30/11 - 03/12 (CLEVOL, online) Chile (2021).
    13. Valenzuela CY. Selective profiles comparisons in DNA or RNA viruses to determine their single or double stranded condition. LVI Annual Meeting of the Chilean Society of Genetics and XV Annual Meeting of the Chilean Society of Evolution, 16-20/10, Talca, Chile (2023).

© 2016-2024, Copyrights Fortune Journals. All Rights Reserved