GenomeHubs
  • Introduction
  • Introduction
    • GenomeHubs
    • Files and containers
    • Demo
  • Quick Start
    • 1. Prepare
    • 2. Setup MySQL database server
    • (optional) Test Ensembl browser
    • 3. Import assembly and gene models
    • 4. Export files
    • 5. Run analyses
    • 6. Import analysis results
    • 7. Update meta
    • 8. Start download site
    • 9. Start BLAST server
    • 10. Start search container
    • 11. Edit Ensembl plugin
    • 12. Start Ensembl browser
  • Next Steps
    • Setup with multiple hosts
    • Understand the GFF parser
    • Import additional assemblies
    • Run comparative analyses
    • Import variation data
    • Add track hubs
    • Connect using Perl API
    • Set up REST API
Powered by GitBook
On this page

Was this helpful?

  1. Introduction

GenomeHubs

PreviousIntroductionNextFiles and containers

Last updated 4 years ago

Was this helpful?

View this documentation at

[1] provide a straightforward way to create a collection of web services to make annotated genome assemblies accessible to a wide community of users. GenomeHubs use containers to package each of the component tools and their dependencies, simplifying the process of setting up and importing data from FASTA and GFF files into:

  • a custom genome browser

  • a BLAST server

  • an powered downloads server

[1] Challis RJ, Kumar S, Stevens L & Blaxter M (2017) GenomeHubs: simple containerized setup of a custom Ensembl database and web server for any species. Database, 2017:bax039 .

GenomeHubs Docker containers are shown in rounded boxes with a double outline and the hosted sites are shown in plain boxes.

GenomeHubs containers are linked by use of common file formats or through a MySQL container via the Ensembl API (arrows show the flow of information) so it is relatively straightforward to expand the feature set by creating new containers to host additional services.

The full set of GenomeHubs Docker containers also includes tools to export data from Ensembl databases into standard file formats, and containers to run analyses on these files that can be imported back into Ensembl database format for display alongside the sequences and gene model data:

  • Blastp against Swissprot to add functional annotations

  • InterProScan to annotate protein domains

  • RepeatMasker to identify repetitive elements

  • Cegma and Busco genome completeness assessments

genomehubs.gitbooks.io
GenomeHubs
Docker
Ensembl
SequenceServer
h5ai
doi:10.1093/database/bax039