two graphs, three responses

34
Two graphs, three responses @rdmpage http://iphylo.blogspot.com

Upload: roderic-page

Post on 20-Mar-2017

3.435 views

Category:

Science


0 download

TRANSCRIPT

Page 1: Two graphs, three responses

Two graphs, three responses

@rdmpage

http://iphylo.blogspot.com

Page 2: Two graphs, three responses

Numbers of new animal names

1923

WWI WWII

Page 3: Two graphs, three responses

Implications

• Taxonomists are working at capacity

• Maybe we are running out of species to discover…(!)

• Most taxonomic work is in the past (“legacy”)

Page 4: Two graphs, three responses
Page 5: Two graphs, three responses

Two graphs

Page 6: Two graphs, three responses

Two graphs

taxonomy sequen

cing

Page 7: Two graphs, three responses

Dark taxa(taxa with sequences but no proper

scientific name)

http://iphylo.blogspot.ca/2011/04/dark-taxa-genbank-in-post-taxonomic.html

Page 8: Two graphs, three responses

Mammals

Proper Linnaean names

Aus sp.

Page 9: Two graphs, three responses

“Invertebrates”

BOLD

Page 10: Two graphs, three responses

Dark taxa

• Disconnect between taxonomy and genomics

• Are “dark taxa” species we already know about, or do they represent new diversity?

Page 11: Two graphs, three responses

Two graphs

taxonomy sequen

cing

Page 12: Two graphs, three responses

Three responses

Make taxonomy digital

Sequences eat the world

Integrate taxonomy & sequences

Page 13: Two graphs, three responses

https://www.flickr.com/photos/nhm_beetle_id/15930177695

Page 14: Two graphs, three responses

GBIF.org 500 million records

Page 15: Two graphs, three responses

Digitise the literature

http://biodiversitylibrary.orghttp://biostor.org

Page 16: Two graphs, three responses

BioNameshttp://bionames.org

• Linking taxonomic names to published literature

• DOIs, BHL, BioStor, PDFs, Gallica, etc.

• 4 million names

• 400K publications

Page 17: Two graphs, three responses

Response 1: Modernise taxonomy

• Barcoding is digitising life (organism to string of letters)

• Mass digitisation of specimens and literature, etc.

• Cross-links between data sources

Page 18: Two graphs, three responses

Response 2: Barcode-only world

• Imagine barcodes were all we had (i.e., we’re microbiologists)

• How would we visualise biodiversity?

Page 19: Two graphs, three responses

Visualise all barcodes

http://iphylo.org/~rpage/bold-map

Page 20: Two graphs, three responses

Page R. Visualising Geophylogenies in Web Maps Using GeoJSON. PLOS Currents Tree of Life. 2015 Jun 23 . Edition 1. doi: 10.1371/currents.tol.8f3c6526c49b136b98ec28e00b570a1e

Page 21: Two graphs, three responses

These are (too) simple

Page 22: Two graphs, three responses

Challenges

• Can we build a complete tree of barcodes?

• Can we (quickly) compute sequence diversity for an area?

• Can we visualise ecological communities?

Page 23: Two graphs, three responses

DNA walksStart at (0,0) and first position in sequence

Move in x,y plane according to whether next base is different

At end of sequence draw point

Plot shows DNA mini-barcodes(127 bp)

Page 24: Two graphs, three responses

“Symbiome”

http://iphylo.blogspot.ca/2011/03/visualising-symbiome-hosts-parasites.html

Insects Crustacea

Page 25: Two graphs, three responses

Response 3: Link taxonomy and barcodes

Page 26: Two graphs, three responses

Integrate using taxonomic names?

Page 27: Two graphs, three responses

Names need not equal taxa

Morethia obscura in GBIF

Page 28: Two graphs, three responses

BINs for Morethia obscura in BOLD

Page 29: Two graphs, three responses

Integrate using specimens?

• Barcodes have vouchers

• GBIF has specimens

• Why isn’t BOLD in GBIF?

• Actually, parts of it already are…

Page 30: Two graphs, three responses

http://doi.org/10.15468/tfpnkp

Page 31: Two graphs, three responses

BOLD samples in GBIF (twice)(but provided by ZSM and EMBL, not BOLD)

BC ZSM Lep 10234GBIF 883514761Casbia rectaria Walker, 1866

GU655831GBIF 1080492017Lepidoptera

7bcace52-85a0-4c5c-991e-555d4479e42c

GWORH520-09BC ZSM Lep 10234BOLD:AAA4623Casbia rectaria

GU655831BC ZSM Lep 10234Lepidoptera sp. BOLD:AAA4623

GBIF

GBIF EMBL

BOLD

Page 32: Two graphs, three responses

The role of type specimens

Type specimen

Type specimen

time of publication

Page 33: Two graphs, three responses

11,664 bird holotype specimens:

For 6,648 GBIF doesn’t understand the name

Page 34: Two graphs, three responses

Three responses

Make taxonomy digital

Sequences eat the world

Integrate taxonomy & sequences