Map gene names to Ensembl
Arguments
- genes
Gene names (e.g.
"TUT4").- organism
character(1). Full Latin organism name (e.g."Homo sapiens").- genomeBuild
character(1). Ensembl genome build assembly name (e.g."GRCh38"). If setNULL, defaults to the most recent build available. Note: don't pass in UCSC build IDs (e.g."hg38").- release
integer(1). Ensembl release version (e.g.100). We recommend setting this value if possible, for improved reproducibility. When left unset, the latest release available via AnnotationHub/ensembldb is used. Note that the latest version available can vary, depending on the versions of AnnotationHub and ensembldb in use.
Examples
## Homo sapiens.
x <- mapGeneNamesToEnsembl(
genes = c("TUT4", "ZCCHC11", "TENT3A"),
organism = "Homo sapiens",
genomeBuild = "GRCh38",
release = 109L
)
#> → Importing HGNC complete set.
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/101476b2a17e6_hgnc_complete_set.txt using base::`read.table()`.
print(x)
#> [1] "ENSG00000134744" "ENSG00000134744" "ENSG00000134744"
## Mus musculus
x <- mapGeneNamesToEnsembl(
genes = c("Nfe2l2", "Nrf2"),
organism = "Mus musculus",
genomeBuild = "GRCm39",
release = 109L
)
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/10707adf835c_Mus_musculus.GRCm39.109.entrez.tsv.gz using base::`read.table()`.
#> → Downloading Mus musculus gene info from NCBI at <https://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/Mammalia/Mus_musculus.gene_info.gz>.
#> → Importing /Users/mike/.cache/R/AcidGenomes/BiocFileCache/107077083a4b8_Mus_musculus.gene_info.gz using base::`read.table()`.
print(x)
#> [1] "ENSMUSG00000015839" "ENSMUSG00000015839"