Skip to content

GitLab

  • Projects
  • Groups
  • Snippets
  • Help
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
P
phylodoc
  • Project overview
    • Project overview
    • Details
    • Activity
  • Issues 19
    • Issues 19
    • List
    • Boards
    • Labels
    • Service Desk
    • Milestones
  • Merge Requests 0
    • Merge Requests 0
  • CI / CD
    • CI / CD
    • Pipelines
    • Jobs
    • Schedules
  • Operations
    • Operations
    • Incidents
    • Environments
  • Analytics
    • Analytics
    • CI / CD
    • Value Stream
  • Wiki
    • Wiki
  • Members
    • Members
  • Collapse sidebar
  • Activity
  • Create a new issue
  • Jobs
  • Issue Boards
  • phyloalps
  • phylodoc
  • Wiki
  • data

Last edited by Anthony Hombiat Nov 07, 2017
Page history
This is an old version of this page. You can view the most recent version or browse the history.

data

The PhyloAlps Project

Home | Data | Model | Technologies | User Profiles | Meetings

Data types

  • Linnaean taxonomy
    • Identifier
    • Basionym i.e. binomial name (genus + species)
    • Synonyms
      • Nomenclature synonym
      • Replaced synonym
  • Herbarium (alcoothèque/silicathèque)
    • Photos (photo server ? existing herbarium numeric model ?)
    • Location (area/GPS coordinates)
    • Collection datetime
  • Occurrences
    • Photos (photo server ? existing herbarium numeric model ?)
    • Location (area/GPS coordinates)
    • Observation datetime
    • Phenotypic traits
    • Noteworthy criteria
  • DNA sequences
    • Genome skimming
    • RAD sequencing
    • Exons capture
  • Population distribution within an area
  • Taxon coverage

Data providers

  • SAJF PhyloAlps Herbarium (work in progress), QR code (species and DNA sequence identification)
  • CBNA Herbarium: Conservatoire Botanique National Alpin à Briançon
  • LECA Androsace: Phenotypic traits for Alpine plants
  • GBIF: Global Biodiversity Information Facility (occurrences, observations, /!\ coverage)
  • IFB: Institut Français pour la Bioinformatique
  • FRB: Fondation pour la Recherche en Biodiversité
  • BOLD: metabarcoding (not complete genome, taxonomic markers, 2 genes: matK et rbcL)
  • INSD: International Nucleotide Sequence Database collaboration :
    • NCBI GenBank: National Center for Biotechnology Information
    • EMBL: European Molecular Biology Laboratory
    • DDBJ: DNA DB of Japan European Molecular Biology Lab (down for the moment)
  • Phylota: plants browser
  • FloraAlpina (paper version only)
  • Tela Botanica: association de botanique de référence pour la botanique numérique
  • Plant list (cf. Kew project: exons sequencing)
  • Biocatalogue: The Life Science Web Services Registry
  • EOL: Encyclopedia of Life
  • USDA: United States Department of Agriculture
  • Mobot: MissOuri BOTanical garden

Data sources

Name Data type Availability Access Comment
PhyloAlps Herbarium Nomenclature, location, photos SAJF data QR codes Work in progress
CBNA Herbarium Nomenclature, location, photos CC BY via GBIF
GBIF Occurrences (nomenclature, location, photos) CC0 -> CC BY-NC REST API Crowdsourcing quality
Androsace Phenotypic traits LECA data HMI
FloraAlpina Phenotypic traits © Book Paper version only
BOLD DNA barcoding © -> CC BY-NC-ND REST API
NCBI GenBank E-utilities Taxonomy, DNA data bank ODbL REST API
EMBL-EBI Ensembl Taxonomy, DNA data bank Third Party REST API, SPARQL endpoint Clients available for Perl/Python/Java
DDBJ WABI Taxonomy, DNA data bank © REST API Some docs in Japanese
iPlant TNRS Nomenclature, synonyms Available REST API Fuzzy name matching
Plant list  Nomenclature, synonyms CC BY-NC-ND HMI > 10 data sources, 3-levels trust model
PlantMiner Nomenclature, synonyms Available REST API
Tela Botanica eFlore Nomenclature, synonyms CC BY-NC-SA REST API Beta version
FCBN SIFlore Distribution map Unavailable HMI
EOL Nomenclature Third party REST API
USDA Plants Taxonomy, location, photos © -> public domain REST API
Mobot APG Taxonomy HTML Web pages

Data sharing

  • Institutions
    • OBO Foundry: Open Biomedical Ontology Foundry
    • HCLSIG: W3C Health Care and Life Sciences Interest Group
    • TDWG: Biodiversity Information Standards group
    • BGCI: Botanic Gardens Conservation International
    • NCEI: National Center for Environmental Information
    • ISB: International Society for Biocuration
    • Species 2000
    • BioSharing
  • Standards & frameworks
    • LSID: Life Science ID
    • SPICE: Species 2000 Interoperability Co-ordination Environment
    • SDD: Structure of Descriptive Data
    • ABCD: Access to Biological Collections Data schema
    • DwC: Darwin Core
    • ISA: Investigation, Study and Assay framework
    • BioDBcore: generic description of the core attributes of biological databases
    • TCS: Taxonomic Concept (Transfer) Schema
  • Ontologies
    • GO: Gene Ontology
    • PO: Plant Ontology
    • GONG: Gene Ontology Next Generation
    • MGED: Micro Array Gene Expression Data
  • Gene-oriented Markup languages
    • MAGE-ML: MicroArray Gene Expression Markup Language
    • KGML: KEGG Kyoto Encyclopedia of Genes and Genomes Markup Language
  • Tools
    • TSE: Taxonomic Search Engine
    • BioCASE: Biological Collection Access SErvice
    • iPhylo: Taxonomic id reconciliation with Google Refine

Data quality

  • Multiple stages data validation process
    • syntactic & semantic check (automatic)
    • outlier check (semi-automatic)
  • Multi-criteria data trust model
  • Issue tracking system
  • Bridge the gap between multiple taxonomic referentials taxonomiques:
    • NCBI
    • Plant List
    • Tax Ref
    • Mobot APG
  • Version control
Clone repository
  • archi
  • biblio
  • data
  • Home
  • meetings
  • meetings
    • md
      • 2017.02.20
      • 2017.03.14
      • 2017.03.27
      • 2017.04.11
      • 2017.04.26
      • 2017.05.12
      • 2017.06.29
      • 2017.07.21
      • 2017.08.02
      • 2017.09.20
View All Pages