Skip to content

Linked sequences#190

Open
colinvwood wants to merge 3 commits intoqiime2:devfrom
colinvwood:linked-sequences
Open

Linked sequences#190
colinvwood wants to merge 3 commits intoqiime2:devfrom
colinvwood:linked-sequences

Conversation

@colinvwood
Copy link
Copy Markdown
Contributor

Allow denoise-paired to rescue and return unmerged sequences

@github-project-automation github-project-automation Bot moved this to Backlog in 2026.4 🌱 Mar 30, 2026
@colinvwood colinvwood moved this from Backlog to In Development in 2026.4 🌱 Mar 30, 2026
@colinvwood
Copy link
Copy Markdown
Contributor Author

needs qiime2/q2-types#388

@colinvwood colinvwood added the stat:blocked This cannot be resolved until something else has changed. label Mar 30, 2026
@colinvwood
Copy link
Copy Markdown
Contributor Author

@ebolyen

@colinvwood
Copy link
Copy Markdown
Contributor Author

disclaimer: the run_dada.R diff is 99% codex and the test_denoise.py diff is about 50/50 codex/de moi

@colinvwood colinvwood self-assigned this Mar 31, 2026
@colinvwood
Copy link
Copy Markdown
Contributor Author

One outstanding decision is how we want to restructure the stats table to accommodate these changes.

@Oddant1 Oddant1 moved this from In Development to In Review in 2026.4 🌱 Apr 3, 2026
@Oddant1 Oddant1 assigned Oddant1 and unassigned colinvwood Apr 3, 2026
@Oddant1 Oddant1 self-requested a review April 3, 2026 17:33
}

# combine the merged and unmerged/concatenated feature tables
if(inp.dirR !='NULL' && retain.unmerged){
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need inp.dirR guard here

### Remove chimeras
cat("5) Remove chimeras (method = ", chimeraMethod, ")\n", sep="")
if(chimeraMethod %in% c("pooled", "consensus")) {
if(chimeraMethod %in% c("pooled", "consensus") && ncol(seqtab) > 0) {
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is ncol(seqtab) > 0 a necessary guard here?

),
abundance=unmerged.j$abundance,
stringsAsFactors=FALSE
)
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

think about whether we can condense at the sample level instead of below separately. Make sure we still have enough information to create e.g. a number unmerged column in the stats table

}

# combine the merged and unmerged/concatenated feature tables
if(inp.dirR !='NULL' && retain.unmerged){
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this entire block will go away if we do sample level table merging

# single space-delimited representation
if(length(unmerged.ids) > 0){
unmerged.keep <- intersect(colnames(seqtab.nochim), unmerged.ids)
if(length(unmerged.keep) > 0){
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need to make sure we ONLY replace the 'N' * 10 that we introduced and not any 'N' * 10s that may have already been present in either the forward or reverse (credit to @Oddant1 for noticing this)

Copy link
Copy Markdown
Contributor Author

@colinvwood colinvwood Apr 3, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

at the very least we error, but we know what the forward and reverse sequences are, so we can deduce what the artificially introduced 'N' * 10 is and remove only it

Copy link
Copy Markdown
Member

@Oddant1 Oddant1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@colinvwood's comments came from the review. Needed to look it over with him to make sure I knew what the R was doing.

@Oddant1 Oddant1 assigned colinvwood and unassigned Oddant1 Apr 6, 2026
@Oddant1 Oddant1 moved this from In Review to In Development in 2026.4 🌱 Apr 6, 2026
@lizgehret lizgehret removed this from 2026.4 🌱 Apr 23, 2026
@github-project-automation github-project-automation Bot moved this to Backlog in 2026.7 🐐 Apr 23, 2026
@lizgehret lizgehret moved this from Backlog to In Development in 2026.7 🐐 Apr 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

stat:blocked This cannot be resolved until something else has changed.

Projects

Status: In Development

Development

Successfully merging this pull request may close these issues.

4 participants