Helper function to select which experimental model(s) will be used to
generate the qSVs. Degradation-associated transcripts are derived in four
different models (transcripts). The cell_component
parameter
controls whether the models with cell-type proportions are included. This
function extract the top top_n
transcripts found to be significant in
each considered model, then returns the union of transcripts across all
considered models. Up to 10,000 transcripts are available to select from
each model prior to taking the union.
select_transcripts(top_n = 1000, cell_component = FALSE)
An integer(1)
specifying how many significant transcripts to
extract from each model prior to taking a union across models.
A logical(1)
. If FALSE
, only include transcripts
from the main and interaction models (see main_model
and int_model
here: transcripts). If TRUE
, additionally include main and
interaction models that include cell-type proportions (a total of 4 models).
A character()
with the transcript IDs.
## Default set of transcripts associated with degradation
sig_transcripts <- select_transcripts()
length(sig_transcripts)
#> [1] 1866
head(sig_transcripts)
#> [1] "ENST00000263388.6" "ENST00000264930.9" "ENST00000378536.4"
#> [4] "ENST00000636484.1" "ENST00000424611.5" "ENST00000447643.1"
## Select more transcripts if desired
length(select_transcripts(top_n = 5000))
#> [1] 8586