PAQman: reference-free ensemble evaluation of long-read genome assemblies.
O'Donnell S, Li N, Steenwyk JL, Geiser DM, Martin FN
Summary
PubMedPAQman is a tool that helps researchers evaluate whether genome sequences are accurate and complete, without needing an existing reference sequence to compare against. It combines seven quality checks into one streamlined framework.
chevron_right Technical Details
Key Findings
Tool measures 7 reference-free quality features: Contiguity, Gene content, Completeness, Accuracy, Correctness, Coverage, and Telomerality
Integrates multiple commonly used assessment programs alongside custom scripts in a unified framework
Requires only query genome assembly and underlying long-read sequencing data as inputs
Original Abstract
Advances in long-read sequencing have made it easier and more cost effective to generate high-quality genome assemblies. However, assessing assembly quality remains a challenge, as existing tools often focus on a few metrics and/or require a reference assembly for comparison. Furthermore, the number of available metrics and associated tools for genome evaluation have expanded in recent years, making it more difficult for researchers to easily use and develop comprehensive pipelines. To address this, we developed the Post-Assembly Quality manager (PAQman), a tool that lowers the barrier to entry for assembly quality assessment by measuring seven reference-free features of genome quality within a single framework: Contiguity, Gene content, Completeness, Accuracy, Correctness, Coverage, and Telomerality. PAQman integrates multiple commonly used tools alongside custom scripts, requiring users to provide only a query genome assembly and its underlying long-read data, while providing a streamlined and consistent framework for quality assessment across datasets.
This connects to 4 other discoveries — 0 species, 4 topics, 0 related articles