Recommendation

Improving the sequencing of single-stranded DNA viruses: Another brick for building Earth's complete virome encyclopedia

Sebastien Massart based on reviews by Philippe Roumagnac and 3 anonymous reviewers

A recommendation of:

T7 DNA polymerase treatment improves quantitative sequencing of both double-stranded and single-stranded DNA viruses

Maud Billaud, Ilias Theodorou, Quentin Lamy-Besnier, Shiraz Shah, François Lecointe, Luisa De Sordi, Marianne De Paepe, Marie-Agnès Petit (2024), bioRxiv, ver.4, peer-reviewed and recommended by PCI Genomics https://doi.org/10.1101/2022.12.12.520144

Read preprint in preprint server Now published in Peer Community Journal

Data used for results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

T7 DNA polymerase treatment improves quantitative sequencing of both double-stranded and single-stranded DNA viruses

Background: Bulk microbiome, as well as virome-enriched shotgun sequencing only reveals the double-stranded DNA (dsDNA) content of a given sample, unless specific treatments are applied. However, genomes of viruses often consist of a circular single-stranded DNA (ssDNA) molecule. Pre-treatment and amplification of DNA using the multiple displacement amplification (MDA) method enables conversion of ssDNA to dsDNA, but this process can lead to over-representation of these circular ssDNA genomes. A more recent alternative permits to bypass the amplification step, as library adapters are ligated to sheared and denatured DNA, after an end-modification step (xGen kit). However, the sonication step might shear ssDNA more efficiently than dsDNA, therefore introducing another bias in virome sequencing. These limitations prompted us to explore an alternative method of DNA preparation for sequencing mixed ssDNA and dsDNA viromes.

Results: Using a synthetic mix of viral particles, we made use of the T7 DNA polymerase (T7pol) to convert viral circular ssDNA molecules to dsDNA, while preventing over-replication of such molecules, as is the case with the Phi29 DNA polymerase. Our findings indicate that using T7pol and a mix of degenerated primers to convert ssDNA to dsDNA prior library preparation is a good alternative to the currently used methods. It better represents the original synthetic mixtures compared to MDA or direct application of the xGen kit. Furthermore, when applied to two complex virome samples, the T7pol treatment improved both the richness and abundance in the Microviridae fraction.

Conclusion: We conclude that T7pol pretreatment is preferable to MDA for the shotgun sequencing of viromes, which is easy to implement and inexpensive.

T7 DNA polymerase, shotgun sequencing, single-stranded DNA

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

يعمل علاج بوليميريز الحمض النووي T7 على تحسين التسلسل الكمي لكل من فيروسات الحمض النووي المزدوجة والمفردة الذين تقطعت بهم السبل

يكشف تسلسل بندقية فيروم فقط عن محتوى الحمض النووي المزدوج (dsDNA) لعينة معينة، ما لم يتم تطبيق علاجات محددة. ومع ذلك، فإن جينومات الفيروسات غالبًا ما تتكون من جزيء DNA أحادي السلسلة (ssDNA). المعالجة المسبقة وتضخيم الحمض النووي باستخدام طريقة تضخيم الإزاحة المتعددة (MDA) تمكن من تحويل ssDNA إلى dsDNA، ولكن هذه العملية يمكن أن تؤدي إلى الإفراط في تمثيل جينومات ssDNA الدائرية. يسمح البديل الأحدث الذي يستخدم مجموعة xGen بربط المحولات مباشرة بالحمض النووي المقطوع والمشوه. ومع ذلك، فإن خطوة الصوتنة قد تقطع ssDNA بشكل أكثر كفاءة من dsDNA، وبالتالي إدخال تحيز آخر في تسلسل الفيروس. دفعتنا هذه القيود إلى استكشاف طريقة بديلة لإعداد الحمض النووي لتسلسل فيروسات ssDNA وdsDNA المختلطة. نقدم هنا طريقة جديدة لتسلسل كل من ssDNA وdsDNA، باستخدام بوليميريز الحمض النووي T7 (T7pol) لتحويل ssDNA إلى dsDNA. قارنا هذه الطريقة مع أسلوبين آخرين: sMDA مع حضانة لمدة 30 دقيقة وقص الحمض النووي المباشر دون تحويل ssDNA إلى dsDNA، باستخدام خليط من خمس عاثيات، بما في ذلك اثنان مع ssDNA. ولضمان العدالة، تم إعداد جميع العينات لاحقًا باستخدام مجموعة xGen. تشير النتائج التي توصلنا إليها إلى أن طريقة T7pol تمثل بشكل أفضل مخاليط الملتهمة الأصلية مقارنة بالطرق الأخرى. علاوة على ذلك، عند تطبيقه على عينتين من الفيروسات المعقدة، أدى علاج T7pol إلى تحسين ثراء ووفرة جزء الفيروسات الدقيقة. نستنتج أن المعالجة المسبقة بـ T7pol أفضل من MDA في تحديد التسلسل الفيروسي للفيروسات، وهو أمر سهل التنفيذ وغير مكلف.

بوليميريز الحمض النووي T7، تسلسل البندقية، الحمض النووي المفرد الذي تقطعت به السبل

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

El tratamiento con ADN polimerasa T7 mejora la secuenciación cuantitativa de virus de ADN monocatenarios y bicatenarios

La secuenciación Virome Shotgun solo revela el contenido de ADN bicatenario (ADNbc) de una muestra determinada, a menos que se apliquen tratamientos específicos. Sin embargo, los genomas de los virus suelen consistir en una molécula circular de ADN monocatenario (ADNss). El pretratamiento y la amplificación del ADN mediante el método de amplificación por desplazamiento múltiple (MDA) permiten la conversión de ssDNA en dsDNA, pero este proceso puede conducir a una sobrerrepresentación de estos genomas circulares de ssDNA. Una alternativa más reciente que emplea el kit xGen permite la ligación de adaptadores directamente al ADN cortado y desnaturalizado. Sin embargo, el paso de sonicación podría cortar el ssDNA de manera más eficiente que el dsDNA, introduciendo así otro sesgo en la secuenciación del viroma. Estas limitaciones nos llevaron a explorar un método alternativo de preparación de ADN para secuenciar viromas mixtos de ADNss y ADNds. Presentamos aquí un nuevo método para secuenciar tanto ssDNA como dsDNA, utilizando la ADN polimerasa T7 (T7pol) para convertir ssDNA en dsDNA. Comparamos este método con otros dos: sMDA con una incubación de 30 minutos y corte directo de ADN sin conversión de ssDNA a dsDNA, utilizando una mezcla de cinco bacteriófagos, incluidos dos con ssDNA. Para garantizar la equidad, todas las muestras se prepararon posteriormente con el kit xGen. Nuestros hallazgos indican que el método T7pol representa mejor las mezclas de fagos originales en comparación con los otros métodos. Además, cuando se aplicó a dos muestras de viroma complejas, el tratamiento con T7pol mejoró tanto la riqueza como la abundancia en la fracción de Microviridae. Concluimos que el pretratamiento con T7pol es preferible al MDA para la secuenciación rápida de viromas, que es fácil de implementar y económico.

ADN polimerasa T7, secuenciación escopeta, ADN monocatenario

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Le traitement par l'ADN polymérase T7 améliore le séquençage quantitatif des virus à ADN double brin et simple brin

Le séquençage Virome shotgun ne révèle que la teneur en ADN double brin (ADNdb) d'un échantillon donné, à moins que des traitements spécifiques ne soient appliqués. Cependant, les génomes des virus sont souvent constitués d’une molécule circulaire d’ADN simple brin (ADNsb). Le prétraitement et l'amplification de l'ADN à l'aide de la méthode d'amplification à déplacement multiple (MDA) permettent la conversion de l'ADN double brin en ADN double brin, mais ce processus peut conduire à une surreprésentation de ces génomes circulaires d'ADN double brin. Une alternative plus récente utilisant le kit xGen permet la ligature des adaptateurs directement sur l'ADN cisaillé et dénaturé. Cependant, l’étape de sonication pourrait cisailler l’ADN double brin plus efficacement que l’ADN double brin, introduisant ainsi un autre biais dans le séquençage du virome. Ces limitations nous ont incités à explorer une méthode alternative de préparation de l’ADN pour le séquençage de viromes mixtes d’ADN double brin et d’ADN double brin. Nous présentons ici une nouvelle méthode de séquençage de l’ADN double brin et de l’ADN double brin, en utilisant l’ADN polymérase T7 (T7pol) pour convertir l’ADN double brin en ADN double brin. Nous avons comparé cette méthode à deux autres : le sMDA avec une incubation de 30 minutes et un cisaillement direct de l'ADN sans conversion d'ADNsb en ADNdb, en utilisant un mélange de cinq bactériophages, dont deux avec ADNsb. Pour garantir l'équité, tous les échantillons ont ensuite été préparés avec le kit xGen. Nos résultats indiquent que la méthode T7pol représente mieux les mélanges de phages originaux par rapport aux autres méthodes. De plus, lorsqu’il est appliqué à deux échantillons de viromes complexes, le traitement T7pol a amélioré à la fois la richesse et l’abondance de la fraction Microviridae. Nous concluons que le prétraitement T7pol est préférable au MDA pour le séquençage shotgun des viromes, qui est facile à mettre en œuvre et peu coûteux.

ADN polymérase T7, séquençage shotgun, ADN simple brin

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

T7 डीएनए पोलीमरेज़ उपचार डबल-स्ट्रैंडेड और सिंगल-स्ट्रैंडेड डीएनए वायरस दोनों की मात्रात्मक अनुक्रमण में सुधार करता है

वाइरोम शॉटगन अनुक्रमण से किसी दिए गए नमूने की केवल डबल-स्ट्रैंडेड डीएनए (डीएसडीएनए) सामग्री का पता चलता है, जब तक कि विशिष्ट उपचार लागू नहीं किया जाता है। हालाँकि, वायरस के जीनोम में अक्सर एक गोलाकार एकल-फंसे डीएनए (एसएसडीएनए) अणु होते हैं। एकाधिक विस्थापन प्रवर्धन (एमडीए) विधि का उपयोग करके डीएनए का पूर्व-उपचार और प्रवर्धन एसएसडीएनए को डीएसडीएनए में परिवर्तित करने में सक्षम बनाता है, लेकिन इस प्रक्रिया से इन परिपत्र एसएसडीएनए जीनोम का अति-प्रतिनिधित्व हो सकता है। एक्सजेन किट का उपयोग करने वाला एक और हालिया विकल्प एडेप्टर को सीधे कतरनी और विकृत डीएनए से जोड़ने की अनुमति देता है। हालाँकि, सोनिकेशन चरण dsDNA की तुलना में ssDNA को अधिक कुशलता से कतर सकता है, इसलिए वाइरोम अनुक्रमण में एक और पूर्वाग्रह का परिचय देता है। इन सीमाओं ने हमें मिश्रित एसएसडीएनए और डीएसडीएनए वाइरोम्स के अनुक्रमण के लिए डीएनए तैयार करने की एक वैकल्पिक विधि का पता लगाने के लिए प्रेरित किया। हम यहां ssDNA को dsDNA में परिवर्तित करने के लिए T7 डीएनए पोलीमरेज़ (T7pol) का उपयोग करके ssDNA और dsDNA दोनों को अनुक्रमित करने के लिए एक नई विधि प्रस्तुत करते हैं। हमने इस विधि की तुलना दो अन्य से की: 30 मिनट के ऊष्मायन के साथ एसएमडीए और एसएसडीएनए के बिना डीएसडीएनए रूपांतरण के लिए प्रत्यक्ष डीएनए कतरनी, पांच बैक्टीरियोफेज के मिश्रण का उपयोग करते हुए, जिसमें एसएसडीएनए के साथ दो शामिल हैं। निष्पक्षता सुनिश्चित करने के लिए, सभी नमूने बाद में xGen किट से तैयार किए गए। हमारे निष्कर्ष बताते हैं कि T7pol विधि अन्य विधियों की तुलना में मूल फ़ेज़ मिश्रण का बेहतर प्रतिनिधित्व करती है। इसके अलावा, जब दो जटिल वाइरोम नमूनों पर लागू किया गया, तो T7pol उपचार ने माइक्रोविरिडे अंश में समृद्धि और प्रचुरता दोनों में सुधार किया। हम यह निष्कर्ष निकालते हैं कि वायरोम्स की शॉटगन अनुक्रमण के लिए एमडीए के लिए टी7पोल प्रीट्रीटमेंट बेहतर है, जिसे लागू करना आसान और सस्ता है।

टी7 डीएनए पोलीमरेज़, शॉटगन अनुक्रमण, एकल-स्ट्रैंडेड डीएनए

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

T7 DNA ポリメラーゼ処理により、二本鎖と一本鎖の両方の DNA ウイルスの定量的シーケンスが改善されます。

Virome ショットガンシーケンスでは、特別な処理が適用されない限り、特定のサンプルの二本鎖 DNA (dsDNA) の内容のみが明らかになります。しかし、ウイルスのゲノムは多くの場合、環状一本鎖 DNA (ssDNA) 分子で構成されています。多重置換増幅 (MDA) 法を使用した DNA の前処理と増幅により、ssDNA から dsDNA への変換が可能になりますが、このプロセスにより環状 ssDNA ゲノムが過剰に表現される可能性があります。 xGen キットを使用した最近の代替方法では、切断および変性した DNA にアダプターを直接ライゲーションできます。ただし、超音波処理ステップは dsDNA よりも ssDNA を効率的に剪断する可能性があるため、バイローム配列決定に別のバイアスが導入されます。これらの制限により、混合 ssDNA および dsDNA ウイルスの配列を決定するための DNA 調製の代替方法を探索するようになりました。ここでは、T7 DNA ポリメラーゼ (T7pol) を使用して ssDNA を dsDNA に変換し、ssDNA と dsDNA の両方を配列決定する新しい方法を紹介します。この方法を他の 2 つの方法と比較しました。30 分間インキュベートする sMDA と、ssDNA を持つ 2 つを含む 5 つのバクテリオファージの混合物を使用した、ssDNA から dsDNA への変換を行わない直接 DNA 切断です。公平性を確保するために、すべてのサンプルはその後 xGen キットを使用して調製されました。我々の発見は、T7pol 法が他の方法と比較して元のファージ混合物をより良く表現していることを示しています。さらに、T7pol 処理を 2 つの複雑なバイロームサンプルに適用すると、マイクロウイルス科画分の豊富さと存在量の両方が改善されました。ウイルスのショットガンシークエンシングには、実装が簡単で安価な T7pol 前処理が MDA よりも好ましいと結論付けています。

T7 DNA ポリメラーゼ、ショットガンシーケンシング、一本鎖 DNA

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

O tratamento com DNA polimerase T7 melhora o sequenciamento quantitativo de vírus de DNA de fita dupla e de fita simples

O sequenciamento shotgun do Virome revela apenas o conteúdo de DNA de fita dupla (dsDNA) de uma determinada amostra, a menos que tratamentos específicos sejam aplicados. No entanto, os genomas dos vírus geralmente consistem em uma molécula circular de DNA de fita simples (ssDNA). O pré-tratamento e a amplificação do DNA usando o método de amplificação de deslocamento múltiplo (MDA) permitem a conversão de ssDNA em dsDNA, mas esse processo pode levar à super-representação desses genomas circulares de ssDNA. Uma alternativa mais recente que emprega o kit xGen permite a ligação de adaptadores diretamente ao DNA cortado e desnaturado. No entanto, o passo de sonicação pode cortar o ssDNA de forma mais eficiente do que o dsDNA, introduzindo assim outro viés na sequenciação do viroma. Essas limitações nos levaram a explorar um método alternativo de preparação de DNA para sequenciar viromas mistos de ssDNA e dsDNA. Apresentamos aqui um novo método para sequenciar tanto o ssDNA quanto o dsDNA, usando a DNA polimerase T7 (T7pol) para converter o ssDNA em dsDNA. Comparamos esse método com outros dois: sMDA com incubação de 30 minutos e cisalhamento direto de DNA sem conversão de ssDNA em dsDNA, usando uma mistura de cinco bacteriófagos, incluindo dois com ssDNA. Para garantir a imparcialidade, todas as amostras foram posteriormente preparadas com o kit xGen. Nossas descobertas indicam que o método T7pol representa melhor as misturas originais de fagos em comparação com os outros métodos. Além disso, quando aplicado a duas amostras complexas de viroma, o tratamento com T7pol melhorou tanto a riqueza como a abundância na fração Microviridae. Concluímos que o pré-tratamento com T7pol é preferível ao MDA para o sequenciamento shotgun de viromas, que é fácil de implementar e barato.

DNA polimerase T7, sequenciamento shotgun, DNA de fita simples

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Секвенирование с помощью дробовика Virome выявляет только содержание двухцепочечной ДНК (дцДНК) в данном образце, если не применяются специальные методы лечения. Однако геномы вирусов часто состоят из кольцевой одноцепочечной молекулы ДНК (оцДНК). Предварительная обработка и амплификация ДНК с использованием метода множественной амплификации смещения (MDA) позволяет преобразовать оцДНК в дцДНК, но этот процесс может привести к чрезмерному представительству этих кольцевых геномов оцДНК. Более поздняя альтернатива, использующая набор xGen, позволяет лигировать адаптеры непосредственно к разрезанной и денатурированной ДНК. Однако этап обработки ультразвуком может расщеплять оцДНК более эффективно, чем дцДНК, что вносит еще одну ошибку в секвенирование вирома. Эти ограничения побудили нас изучить альтернативный метод подготовки ДНК для секвенирования смешанных виромов оцДНК и дцДНК. Мы представляем здесь новый метод секвенирования оцДНК и дцДНК с использованием ДНК-полимеразы Т7 (T7pol) для преобразования оцДНК в дцДНК. Мы сравнили этот метод с двумя другими: sMDA с 30-минутной инкубацией и прямым сдвигом ДНК без преобразования оцДНК в дцДНК, используя смесь пяти бактериофагов, в том числе двух с оцДНК. Для обеспечения справедливости все образцы впоследствии были подготовлены с использованием набора xGen. Наши результаты показывают, что метод T7pol лучше представляет исходные смеси фагов по сравнению с другими методами. Кроме того, при применении к двум сложным образцам вирома обработка T7pol улучшила как богатство, так и численность фракции Microviridae. Мы пришли к выводу, что предварительная обработка T7pol предпочтительнее MDA для дробового секвенирования виромов, которое легко реализовать и недорого.

bf386e73638a45788c92131d28февраля313 Обработка ДНК-полимеразой Т7 улучшает количественное секвенирование как двухцепочечных, так и одноцепочечных ДНК-вирусов. b4b32dc06ab34674ac225a3283dce821 ДНК-полимераза Т7, дробовое секвенирование, одноцепочечная ДНК

ДНК-полимераза Т7, дробовое секвенирование, одноцепочечная ДНК

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

T7 DNA 聚合酶处理改善了双链和单链 DNA 病毒的定量测序

病毒组鸟枪法测序只能揭示给定样本的双链 DNA (dsDNA) 含量，除非采用特殊处理。然而，病毒的基因组通常由环状单链 DNA (ssDNA) 分子组成。使用多重置换扩增 (MDA) 方法对 DNA 进行预处理和扩增，可以将 ssDNA 转化为 dsDNA，但此过程可能会导致这些环状 ssDNA 基因组的过度表达。最近的替代方案采用 xGen 试剂盒，允许将接头直接连接到剪切和变性的 DNA。然而，超声处理步骤可能比 dsDNA 更有效地剪切 ssDNA，因此在病毒组测序中引入了另一个偏差。这些限制促使我们探索另一种 DNA 制备方法，用于对混合 ssDNA 和 dsDNA 病毒组进行测序。我们在此提出一种对 ssDNA 和 dsDNA 进行测序的新方法，使用 T7 DNA 聚合酶 (T7pol) 将 ssDNA 转化为 dsDNA。我们将此方法与其他两种方法进行了比较：sMDA 孵育 30 分钟，直接 DNA 剪切，无需将 ssDNA 转换为 dsDNA，使用五种噬菌体的混合物，其中两种噬菌体带有 ssDNA。为了确保公平性，所有样品随后均使用 xGen 试剂盒制备。我们的研究结果表明，与其他方法相比，T7pol 方法更好地代表了原始噬菌体混合物。此外，当应用于两个复杂的病毒组样本时，T7pol 处理提高了微病毒科部分的丰富度和丰度。我们的结论是，在病毒组鸟枪法测序中，T7pol 预处理优于 MDA，易于实施且成本低廉。

T7 DNA 聚合酶、鸟枪法测序、单链 DNA

Submission: posted 20 December 2023, validated 20 December 2023
Recommendation: posted 18 June 2024, validated 03 July 2024

Cite this recommendation as:
Massart, S. (2024) Improving the sequencing of single-stranded DNA viruses: Another brick for building Earth's complete virome encyclopedia. Peer Community in Genomics, 100335. https://doi.org/10.24072/pci.genomics.100335

Recommendation

The wide adoption of high-throughput sequencing technologies has uncovered an astonishing diversity of viruses in most biosphere habitats. Among them, single-stranded DNA viruses are prevalent, infecting diverse hosts from all three domains of life (Malathi et al. 2014) with some species being highly pathogenic to animals or plants.

Sequencing of single-stranded DNA viruses requires a specific approach that usually leads to their over-representation compared to double-stranded DNA. The article from Billaud et al. (2024) addresses this challenge. It presents a novel and efficient method for converting single-stranded DNA to double-stranded DNA using T7 DNA polymerase before high-throughput virome sequencing. It compares this new method with the Phi29 polymerase method, demonstrating its advantages in the representation and accuracy of viral DNA content in well-defined synthetic phage mixtures and complex human virome samples from the stool. This T7 DNA polymerase treatment significantly improved the richness and abundance of the Microviridae fraction in their samples, suggesting a more comprehensive representation of viral diversity.

The article presents a compelling case for testing and adopting the T7 DNA polymerase methodology in preparing virome samples for shotgun sequencing. This novel approach, supported by comparative analysis with existing methodologies, represents a valuable contribution to metagenomics for characterizing virome diversity.

References

Billaud M, Theodorou I, Lamy-Besnier Q, Shah SA, Lecointe F, Sordi LD, Paepe MD, Petit M-A (2024) T7 DNA polymerase treatment improves quantitative sequencing of both double-stranded and single-stranded DNA viruses. bioRxiv, ver. 4 peer-reviewed and recommended by Peer Community in Genomics. https://doi.org/10.1101/2022.12.12.520144

Malathi VG, Renuka Devi P. (2019) ssDNA viruses: key players in global virome. Virus disease. 30: 3–12. https://doi.org/10.1007/s13337-019-00519-4

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
This work was funded by INRAE and Sorbonne Université, as well as the ANR project PRIMAVERA

Reviews

Evaluation round #2

DOI or URL of the preprint: https://doi.org/10.1101/2022.12.12.520144

Version of the preprint: 3

Author's Reply, 17 Jun 2024

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.genomics.100335.ar2

Decision by Sebastien Massart, posted 11 Jun 2024, validated 11 Jun 2024

Dear authors,

Thank you very much for resubmitting the new version of your research work.

Two anonymous reviewers and I have evaluated this resubmission. The adaptation improved the quality of the document. There are still some minor clarifications that are welcome before the final recommended version. You will find my few comments included in the attached document.

Another point of utmost importance is the availability of your raw data. I double checked them and it appeared the raw data only include the first experiment with phages while the virome data are not available. The open access of raw data is not only a requirement for high-quality publication, but it also plays a crucial role in advancing scientific knowledge. I kindly ask you to make these data available.

Thanks again for considering PCI in Genomics for your publication.
Kind regards,

Sébastien Massart

Download recommender's annotations

https://doi.org/10.24072/pci.genomics.100335.d2

Reviewed by Philippe Roumagnac, 28 May 2024

Dear Recommender,

My mistake, I do agree with the authors that the ssDNA viruses won’t be sequenced in theory, if nothing is done to ensure the sequencing of the ssDNA fraction. Therefore, the authors have fixed all my minor concerns. From my side, this revised version of the manuscript can be accepted.

Best regards,

Philippe

https://doi.org/10.24072/pci.genomics.100335.rev21

Reviewed by anonymous reviewer 3, 09 Jun 2024

The authors fulfilled the issues raised by the reviewers including me.

https://doi.org/10.24072/pci.genomics.100335.rev22

Evaluation round #1

DOI or URL of the preprint: https://doi.org/10.1101/2022.12.12.520144

Version of the preprint: 2

Author's Reply, 19 Apr 2024

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.genomics.100335.ar1

Decision by Sebastien Massart, posted 20 Feb 2024, validated 21 Feb 2024

Dear Authors,

On behalf of the board of Peer Community in Genomics, I would like to thank you for considering it for sending your publication.

Two anonymous reviewers and I have read your document and found it interesting while deserving significant improvements. Indeed, there are currently several comments and suggestions that could improve the manuscript.

You can find the comments and suggestions of both reviewers and myself in this response and we thank you in advance for answering these point by point while adapting the text when necessary.

Kind regards,

Sébastien Massart

Comments from the recommender:

Summary

- In line 30 there is a reference to xgen kit which is a commercial kit. Could you instead explain the biochemical mechanism on which the kit is relying ?

- In line 35: the S before MDA should be explained (it is only defined in line 67)

- In Line 36: is the direct DNA shearing linked to Xgen kit or to the original shotgun sequencing ? It should be clearer (it is understood further in the document but the summary is self standing).

- Line 37: “including two with ssDNA” adapted to “among which 3 with dsDNA and 2 with ssDNA genome” ?

- Results: it is mainly material and methods while the results are very (too ?) synthetic. Please adapt the text.

Introduction

- Line 59: low amount of DNA or of viral DNA ? Please clarify

- Line 69-71: same point as abstract: explaining biochemical/technological principle and referring to commercial kit further on (inverting order with following lines explaining the protocol)

- Lines 76-79: is there any scientific reference for it ?

- Lines 82-86: these are already results. Could you instead summarize the methodology (testing pure virome + virome with cells) ? + comment for lines 92-93

Material and methods

- Lines 92-93: objective statement would fit well at the end of the introduction instead of methods.

- Lines 92-93: The objective isn’t it for comparing protocols of ssDNA enrichment (the mixes are the tools to reach the objective)?

- Lines 97-98: can the third method be considered as a control?

- Line 101: abbreviate NEB at first occurrence.

- Line 109: is there a reference (publication, official repository) for strain C?

- Lines 107-110: the two phages should be presented in the first sentence (that only describes PhiX174) and developed in two independent sentences further on

- Lines 111-116: same as for ssDNA: citing the 3 phages in first sentence and describing each of them next.

- Line 119: citing the provider of PES membranes.

- Line 120: reference to LB medium (provider).

- Line 121: is there a temperature during centrifugation? or room temperature?

- Line 132: what is SM buffer?

- Line 133: CaCl2 not correct form for 2

- Lines 136-137: centrifugation steps not described for PCA extraction.

- Phage stock preparation and DNA Extraction:

o overall, it is not clear for me what has been carried out on which phage. Indeed, there is the description of a DNAse treatment in lines122-125 and another one in lines 131-133 (including a RNAse).

o Phage DNA preparation only related to T4, SPP1 and lambda (Line 128): what about PhiX174 and M13-ypf ?

- Line 139: estimating a concentration using nanodrop (for ssDNA) is not so reliable. Could you provide evidence that nanodrop allowed accurate concentration evaluation? Why not using another system such as Qubit ssDNA Assay Kit? (in link with comments of a reviewer)

- Lines 141-142: is the protocol identical to the one described just above or not?

- Lines 143-144: why using different volumes for each treatment ? It could introduce bias potentially. Can the authors discuss it (explaining also the rationale of the choice - potentially linked to manufacturer instruction?)

- Line 148 and elsewhere: stating supplementary table 1, 2.. in full letters

- Line 155: provider of the kit (overall comment: please double check that the providers are mentioned when using kit/enzyme/reagent)

- Line 192: the final temperature was always 46°C after 10 minutes at room temperature?

- Line 203: consistency when citing the provider of phages and DNA (adding reference number as before)

- Line 215: a mix of 6 PCR fragment is mentioned but its origin is not clear as there is no PCR step mentioned in this paragraph (if produced elsewhere, please indicate how or a reference or their size …)

- Line 228-232: there is no indication of mix or reference to publication for the details on the methodology and reagents used for xGen kit (beyond temperatures and time)

- Line 234-235: this depth also concerns the fecal samples? Better to move this depth in results, integrating fecal samples.

- Line 251: is there any specific buffer for this treatment? If so, please mention it

- Line 264-264: please clarify the two approaches because the software approach also uses databases (Vibrant uses KEGG, pfam and VOG)

- Line 267: a bracket is missing after 21

- Line 273: “of both approaches”

- Lines 273-275: it is another approach used (5th one), is it done downstream of another approach or it also starts from contigs. ?

- Bioinformatics analyses of fecal samples :

o a flowchart indicating the analyses done could help having a global picture (as further mapping of normalized reads was carried out). It is not clear how to reproduce the succession of analyses and it needs clarification.

o There is a set of viral contigs for mapping but how is it selected as several methods are used to identify the viral contigs: any viral contig from any method OR only viral species identified by all methods OR …. Please clarify how the set of contigs has been built up.

- Lines 276: all samples -> “The four samples”.

Results

- Line 304: nanodrop is not really reliable for accurate quantification (same comment as before and as a reviewer)

- Line 325: I do not find the mix’ explanations (1:1 in volume? in quantity?) in Methods

- Figure 2: is the standard from a commercial provider or made internally ?

- Setting up a ssDNA-to-dsDNA conversion protocol using T7 DNA polymerase: I do not see any replication of the test nor any test with other organisms (or other proportion between both viruses). There is therefore no information on the reproducibility of the observations. Next chapter shows it worked but can the authors explain why there is no replication carried out (for ensuring the selection of 25 µM for example).

- Line 334: “way of treating DNA” -> protocols

- Line 338: see Suppl. Table 3 enough between brackets

- Figure 3: the legend can be completed referring to Rel value. A distinction in wording between the theorical initial proportion (see comment on quantification protocol) and observed proportion after high throughput sequencing is welcome.

- Figure 3: “Various” sample is vague.

- Suppl. Table 3: distinguish ssDNA from dsDNA phages to facilitate reading of the table.

- Lines 349-358: this is mainly a repetition of the methods. It is very clear so it can be fused with duplicated information in methods.

- Line 376: Aitchison distance calculation is not described in Methods while used in results.

- Line 390: “treated or not with…”

- Line 393: “assembled in…”

- Viromes analyses: globally, numbers would be interesting (number of reads, of contigs, of viral contigs assigned by each algorithm or in total).

- Line 394: it is not clear how the contigs were assigned to microviridae (among the various pipelines used).

- Line 396: microviridae abundance of 24 to 74%: of the total number of reads or of the viral reads ? Stating it has been obtained after mapping is relevant

- Lines 397-399: harmonize the wording: 6 microviridae contigs & 11 microviridae.

- Overall: why the analysis focused only on microviridae and not dsDNA phages ? Indeed, the results on phage mixes show that the reads from dsDNA can drop with T7 polymerase (Lambda in Panel A-2 for example). This could give a global overview of the performance of T7 polymerase treatment

Discussion

- Line 432-433: This is completely true. Nevertheless, there was no replication for some steps of the optimization (for example selecting the primer concentration at 25 µM).

- Line 435: should limit the bias as they still exist based on the presented results

- Line 445: could the 2-fold observation be caused by a saturation in ssDNA in the mix (98%), limiting therefore the further amplification? Does it worth discussing this value?

- Lines 463-465: this comparison is very interesting and raise the question on the actual potential results of the T7 treatment when using Truseq or Nextera. It should be noted that there is no data supporting it. Could the results be different with these kits or guaranteed similar?

- Lines 474-476: this is a very interesting preliminary comparison and linked to the the number of species detected in both samples. Are they comparable to literature or not for microviridae in fecal samples (somehow, 8 species might be very low ?)

- Line 480: “can improve the sequencing of ssDNA viruses”

https://doi.org/10.24072/pci.genomics.100335.d1

Reviewed by anonymous reviewer 1, 06 Feb 2024

This study aimed at providing an alternative method of environmental VLP-derived DNA preparation before NGS-based virome sequencing. The key point of current suggestion is conversion of dsDNA genome into ssDNA genome using T7 polymerase in DNA preparation step. The authors compared the new method with the previous two methods, MDA and xGen kit, and found that the new method minimized the deviations at least from over-estimation of ssDNA viral genomes by MDA and from under-estimation of ssDNA viral genomes by DNA shearing of xGen kit. The T7 pol method is quite convincing, and is expected that more accurate abundance of dsDNA and ssDNA viruses could be estimated in metagenome studies of environmental virome. The following questions make the manuscript strengthen scientifically more, I believe.

1. A synthetic viral mixture was prepared, and three methods were applied to the mixture to be compared (only xGen kit vs. xGen kit + MDD vs. xGen kit + T7 DNA pol). It is thought that T7 DNA pol with no xGen kit would be considered a standard control in this study, as described “it can be applied to samples planned for any kind of downstream library preparation kit for low DNA amounts, such as the Nextera or TruSeq DNA nano kits from Illumina.”. However, the authors set xGen kit as a standard control. I do not think that xGen kit is necessary for T7 polymerase-used virome sequencing, and thus, two groups (no xGen kit, no xGen kit + T7 DNA pol) need to be compared additionally with three methods.

2. The Qubit and Nanodrop devices were used for estimating absolute concentration of dsDNA and ssDNA phage stocks. It is quite not sure that measuring DNA concentration using Nanodrop is accurate. In addition, the number of M13-yfp phage stock was estimated using plaque assay. Three different methods make some predicting exact number of phage genomes, and the deviation from three different methods may have an impact on assessing the abundance of ssDNA viral genomes from three DNA preparation methods.

3. According to the previous study (Appl Environ Microbiol, 2010, 76(15):5039-5045), it is thought that conversion of dsDNA into ssDNA works with E. coli DNA polymerase I in the presence of random hexamers. In this study, T7 polymerase was chosen for conversion of dsDNA into ssDNA, instead of E. coli DNA polymerase I that is commonly used in molecular biology techniques. Please, explain kindly why T7 polymerase was chosen for the DNA conversion.

https://doi.org/10.24072/pci.genomics.100335.rev11

Reviewed by anonymous reviewer 2, 14 Feb 2024

The manuscript entitled: “Method for preparing virome DNA that allows sequencing of both double-stranded and single-stranded DNA viruses” is a well-written manuscript that further explores an alternative method of DNA preparation for sequencing both ssDNA and dsDNA viruses.

This manuscript is remarkably precise and rigorous, and offers a clear protocol that will be of great benefit to the scientific community working on DNA phages.

I have a major concern concerning the concordance between the title of the pdf file that I reviewed (Method for preparing virome DNA that allows sequencing of both double-stranded and single-stranded DNA viruses) and the title shown on BioRxiv website (T7 DNA polymerase treatment improves quantitative sequencing of both double-stranded and single-stranded DNA viruses). Same problem for the lists of authors that are different between the pdf file and the BioRxiv website. This is an issue that should be fixed.

Apart from this issue, I only have two minor concerns:

Summary/Background: the authors state that “Virome shotgun sequencing only reveals the double-stranded DNA (dsDNA) content of a given sample, unless specific treatments are applied”. They may have said that this statement is true when viral genomes are analyzed from “bulk” metagenomes which include both virus particles and microbial cells. Alternatively, semi-purification protocols of virus particle have proved to be efficient (albeit cumbersome) for better detecting ssDNA viruses. The authors may have mentioned in the introduction that these two alternatives (direct shotgun and virus particle enrichment) metagenomics approaches exist and briefly give insights about ssDNA virus yields using both types of approaches.

In the “T7pol treatment of two viromes” paragraph, you noted that 6 of the Microviridae contigs detected in the untreated samples were not present in the T7pol sample, while conversely, 11 Microviridae were found only in the T7pol sample”. This result is illustrated by the Suppl. Table 4. I here have missed a number of elements. I would have liked finding in this Table: the length of the contigs, their taxonomic assignment, the %identity shared between these contig and the Microviridae species stored in the International databases for which they matched, etc. Finally, it would have been welcome to find plylogenetic trees (one using the major capsid protein sequences, and another one using the replication gene sequences). This would have helped figure out whether the “missing” contigs clustered in the phylogenetic tree or were scattered around it.

P3L76: ss- and dsDNA

P4L107: I would have written “is a member of the Microviridae family” rather than “a Microviridae”. This correction would need to be done throughout the ms.

P4L110: Inoviridae in italics

P7L248-256: Indicate that the “sample numbers” of the two healthy donors are “S4” and “S18”.

https://doi.org/10.24072/pci.genomics.100335.rev12

User comments

No user comments yet

or Register
Submit a preprint