omicverse.alignment.build_amplicon_anndata#
- omicverse.alignment.build_amplicon_anndata(otutab_tsv, asv_fasta, sintax_tsv=None, sample_metadata=None, sample_order=None)[source]#
Compose
anndata.AnnDatafrom vsearch stepwise outputs.- Parameters:
otutab_tsv (
str) – Path to the--otutaboutTSV fromvsearch.usearch_global(). First column = ASV id, remaining columns = sample ids.asv_fasta (
str) – Path to the ASV centroid FASTA (output ofvsearch.unoise3()or the non-chimera FASTA fromvsearch.uchime3_denovo()).sintax_tsv (
Optional[str] (default:None)) – Optional path to the vsearch--sintax --tabbedoutTSV. When supplied, 7-rank taxonomy columns (domain/phylum/class/order/family/genus/species), the;-joinedtaxonomystring andsintax_confidenceare written intovar.sample_metadata (
Optional[DataFrame] (default:None)) – Optional DataFrame indexed by sample id; merged intoobs.sample_order (
Optional[Sequence[str]] (default:None)) – Optional list of sample ids to enforce row order inobs/X. Defaults to the column order of the otutab TSV.
- Returns:
Xis ascipy.sparse.csr_matrixof int32 counts (samples × ASVs). Same schema asamplicon_16s_pipeline().- Return type: