Map gene names to Ensembl
Arguments
- genes
Gene names (e.g.
"TUT4"
).- organism
character(1)
. Full Latin organism name (e.g."Homo sapiens"
).- genomeBuild
character(1)
. Ensembl genome build assembly name (e.g."GRCh38"
). If setNULL
, defaults to the most recent build available. Note: don't pass in UCSC build IDs (e.g."hg38"
).- release
integer(1)
. Ensembl release version (e.g.100
). We recommend setting this value if possible, for improved reproducibility. When left unset, the latest release available via AnnotationHub/ensembldb is used. Note that the latest version available can vary, depending on the versions of AnnotationHub and ensembldb in use.
Examples
## Homo sapiens.
x <- mapGeneNamesToEnsembl(
genes = c("TUT4", "ZCCHC11", "TENT3A"),
organism = "Homo sapiens",
genomeBuild = "GRCh38",
release = 109L
)
#> → Importing HGNC complete set.
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/101476b2a17e6_hgnc_complete_set.txt using base::`read.table()`.
print(x)
#> [1] "ENSG00000134744" "ENSG00000134744" "ENSG00000134744"
## Mus musculus
x <- mapGeneNamesToEnsembl(
genes = c("Nfe2l2", "Nrf2"),
organism = "Mus musculus",
genomeBuild = "GRCm39",
release = 109L
)
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/10707adf835c_Mus_musculus.GRCm39.109.entrez.tsv.gz using base::`read.table()`.
#> → Downloading Mus musculus gene info from NCBI at <https://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/Mammalia/Mus_musculus.gene_info.gz>.
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/107077083a4b8_Mus_musculus.gene_info.gz using base::`read.table()`.
print(x)
#> [1] "ENSMUSG00000015839" "ENSMUSG00000015839"