Submit a preprint

6

A rapid and simple method for assessing and representing genome sequence relatednessuse asterix (*) to get italics
M Briand, M Bouzid, G Hunault, M Legeay, M Fischer-Le Saux, M BarretPlease use the format "First name initials family name" as in "Marie S. Curie, Niels H. D. Bohr, Albert Einstein, John R. R. Tolkien, Donna T. Strickland"
2020
<p>Coherent genomic groups are frequently used as a proxy for bacterial species delineation through computation of overall genome relatedness indices (OGRI). Average nucleotide identity (ANI) is a widely employed method for estimating relatedness between genomic sequences. However, pairwise comparisons of genome sequences based on ANI is relatively computationally intensive and therefore precludes analyses of large datasets composed of thousands of genome sequences. In this work we proposed a workflow to compute and visualize relationships between genomic sequences. A dataset containing more than 3,500 *Pseudomonas* genome sequences was successfully classified with an alternative OGRI based on *k*-mer counts in few hours with the same precision as ANI. A new visualization method based on zoomable circle packing was employed for assessing relationships among the 350 groups generated. Amendment of databases with these *Pseudomonas* groups greatly improved the classification of metagenomic read sets with *k*-mer-based classifier. The developed workflow was integrated in the user-friendly KI-S tool that is available at the following address: https://iris.angers.inra.fr/galaxypub-cfbp.</p>
https://www.ncbi.nlm.nih.gov/genome/You should fill this box only if you chose 'All or part of the results presented in this preprint are based on data'. URL must start with http:// or https://
https://sourcesup.renater.fr/wiki/ki-s/You should fill this box only if you chose 'Scripts were used to obtain or analyze the results'. URL must start with http:// or https://
You should fill this box only if you chose 'Codes have been used in this study'. URL must start with http:// or https://
ANI, k -mers, genome sequence relatedness, similarity matrix representation, circle packing, Pseudomonas , metagenome
NonePlease indicate the methods that may require specialised expertise during the peer review process (use a comma to separate various required expertises).
Bioinformatics, Metagenomics
e.g. John Doe john@doe.com
No need for them to be recommenders of PCI Genomics. Please do not suggest reviewers for whom there might be a conflict of interest. Reviewers are not allowed to review preprints written by close colleagues (with whom they have published in the last four years, with whom they have received joint funding in the last four years, or with whom they are currently writing a manuscript, or submitting a grant proposal), or by family members, friends, or anyone for whom bias might affect the nature of the review - see the code of conduct
e.g. John Doe john@doe.com
2019-11-07 16:37:56
B. Jesse Shapiro
Gavin Douglas