Skip to content

tkosciol/qiita

 
 

Repository files navigation

Qiita (canonically pronounced cheetah)

Build Status Coverage Status Gitter

Advances in sequencing, proteomics, transcriptomics and metabolomics are giving us new insights into the microbial world and dramatically improving our ability to understand their community composition and function at high resolution. These new technologies are generating vast amounts of data, even from a single study or sample, leading to challenges in storage, representation, analysis, and integration of the disparate data types. Qiita was designed to allow users address these new challenges by keeping track of multiple studies with multiple ‘omics data. Additionally, Qiita is capable of supporting multiple analytical pipelines through a 3rd-party plugin system, allowing the user to have a single entry point for all their analyses. Qiita’s main site provides database and compute resources to the global community, alleviating the technical burdens, such as familiarity with the command line or access to compute power, that are typically limiting for researchers studying microbial ecology.

Qiita is currently in alpha status. We are very open to community contributions and feedback. If you're interested in contributing to Qiita, see CONTRIBUTING.md. If you'd like to report bugs or request features, you can do that in the Qiita issue tracker.

To install and configure your own Qiita server, see INSTALL.md.

For more specific details about qiita visit the Qiita main site tutorial.

Current features

  • Full study management: Create, delete, update samples in the sample and multiple preparation information files.
  • Upload files via direct drag & drop from the web interface or via scp from any server that allows these connections.
  • Study privacy management: Sandboxed -> Private -> Public.
  • Easy long-term sequence data deposition to the European Nucleotide Archive (ENA), part of the European Bioinformatics Institute (EBI) for private and public studies.
  • Raw data processing for:
    • Target gene data: we support deblur against GreenGenes (13_8) and close reference picking against GreenGenes (13_8) and Silva.
    • Metagenoic/Shotgun data: we support Shogun processing. Note that this data is suitable for download and further down analyses but we don't recommend analysis within Qiita.
    • biom files can be added as new preparation templates for downstream analyses; however, this cannot be made public.
  • Basic downstream analyses using Qiime2.
  • Basic study search in the study listing page.
  • Complex metadata search via redbiom.

For more detailed information visit the Qiita tutorial and the Qiita help.

Accepted raw files

  • Multiplexed SFF
  • Multiplexed FASTQ: forward, reverse (optional), and barcodes
  • Per sample FASTQ: forward
  • Multiplexed FASTA/qual files

Known issues

  • The metadata search is broken
  • Analysis will fail for non 16S datasets and/or multiple processed data from the same study
  • https is down for *.microbio.me

Roadmap

The following is a non-exhaustive list of features that we plan to add in the future.

  • Allow to update any kind of biom for data analysis and visualization
  • Integration of other pipelines via artifacts. Processing of raw data in external sources. For example, metabolomics processing in GNPS and data visualization in Qiita.
  • Creation of a REST API to query and access the data hosted by Qiita.
  • Improved analysis pipeline for target gene datasets.
  • Crowd-sourced metadata curation of existing studies: improve the metadata of existing studies by submitting “fix proposals” to the authors of the study.

About

Qiita - A multi-omics databasing effort

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 68.6%
  • HTML 25.5%
  • PLpgSQL 2.6%
  • JavaScript 2.5%
  • SQLPL 0.4%
  • Makefile 0.2%
  • Other 0.2%