speciesbank dreams and realities rainer froese ifm-geomar march 2005

29
SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR [email protected] March 2005

Upload: ezra-webb

Post on 06-Jan-2018

223 views

Category:

Documents


0 download

DESCRIPTION

Reality Check: Background Two-third of all major software projects fail (IHT )

TRANSCRIPT

Page 1: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

SpeciesBank Dreams and Realities

Rainer FroeseIfM-GEOMAR

[email protected] 2005

Page 2: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

The SpeciesBank Dream

„... a computer interface to the Internet able to find, combine and present data in a way that would be meaningful and

useful to the person who issued a query about a species.“

Page 3: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Reality Check: Background

Two-third of all major software projects fail (IHT 25.1.05)

Page 4: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

SpeciesBanks, what are they not?

• Regional or global checklists• Purely distributed systems• Google: mix of good and garbage• Three year projects• Amateur products• Specialist products• Committee products• Community products

Page 5: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Reality Check: Users

• Taxonomists?• ‚Decision makers‘?• Stakeholders?• Nobody?• Depends on usefulness:• Mostly interested public & students• Few specialists

Page 6: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

FishBase Web Users

Individuals

Private sector

Universities

Governments

NGOs

Museums

Int. Research Centres

Based on 2122 entries in the FishBase Guestbook, June 2004

Page 7: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

FishBase Users by Continent

North America

Europe

Asia

South America

Africa

Australia

Oceania

Based on 2122 entries in the FishBase Guestbook , June 2004

Page 8: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005
Page 9: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Reality Check: User Needs

• Politically very important• Boring at best (motherhood statements)• Typically misleading

– Most users don‘t know what they need• Scientific approach: analyze actual usage

of what is available

Page 10: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

FishBase Usage Compared with Internet Usage by Country, in 2001

0.0001

0.001

0.01

0.1

1

10

100

0.0001 0.001 0.01 0.1 1 10 100

Internet Users (% of all Users)

Fish

Bas

e S

essi

ons

(% o

f all

Ses

sion

s)

USA

Qatar

Fr Polynesia

Brunei

Japan

China

South Korea

Russia

Taiwan

India

Lebanon

Venezuela

Page 11: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005
Page 12: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

0.0001

0.001

0.01

0.1

1

10

100

0.0001 0.001 0.01 0.1 1 10 100Internet usage by country (%)

Fish

Bas

e us

age

by c

ount

ry (%

)(J

uly

2004

)USA

China

Greece

Luxembourg

Saudi Arabia

Peru

Pakistan

Ghana

Belarus

Cuba

South Korea

Russia

BrazilAustralia

JapanGermany

PNG

India

Kenya

Page 13: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

FishBase Web Usage by Topic May-July 2003

Common Names

Scientific Names

Direct Links

Species Summaries

Photos

Fish Quiz

Specialist Topics

Based on hits by directory on CGNET server

Page 14: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Common names

Scientific names

Direct links

Species summaries

Photos

Fish Quiz

Specialist topics

Based on AW Hits by directory on Kiel server

FishBase Usage by Topic July 2004

Page 15: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Frequency of FishBase Usage by Topic

0 10 20 30 40 50 60

Genetics

Diseases

FishQuiz

Reproduction

Maps

FB book

Physiology

Population dynamics

Trophic ecology

Identification

LarvalBase

Scientific Names

Museum collections

Eschmeyer

Glossary

References

Country information

Photos

Common Names

Species Summaries

Percent of total page views (July 2004)

About 10,000 visitors per month

Page 16: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

What Determines Usage

• Quality and accuracy?• Recognition of scientists behind database?• MoUs? • Beautiful interface, fancy tools?• Content: common names, photos, summaries• Simplicity of interface (e.g. Google)• Number of clicks needed; response time

Page 17: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

FishBase Usage over Time

0

2

4

6

8

10

12

14

16

A 98 A 99 A 00 A 01 A 02 A 03 A 04 A 05Years (August)

Hits

(mill

ions

)

0

100

200

300

400

500

600

700

800

900

Use

r se

ssio

ns (t

hous

ands

)

Page 18: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

What Determines Usefulness?

• Actual use• What is not used is useless• How about yourself (the custodian)?

Page 19: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

1

10

100

1 10 100 1000 10000 100000

Myx

Ceph

Holo

Elasmo

Sarco

Actino

Species per Class (n)

Str

ateg

ies

per

Cla

ss (

n)

Number of strategies used by phylogenetic Classes plotted over number of recent species in the Class, with linear regression line forced through the origin; slope = 0.37; r2 = 0.9754 .

Page 20: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

1

10

100

1 10 100 1000 10000 100000

Actino

Elasmo

Sarco

Species per Class (n)

Ord

ers

per

Cla

ss (

n)

Orders per Class plotted over Species per Class. Sarcopterygii, Elasmobranchi and Actinopterygii fall nearly on a hypothetical straight line through the origin; slope = 0.37.

Page 21: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

1

10

100

1 10 100 1000 10000 100000 1000000

Species per Class (n)

Ord

ers

per C

lass

(n)

Animalia Plantae Fungi Protozoa Max Orders

Actinopterygii

Insecta

Orders per Class plotted over species per Class for four Kingdoms and 415,000 species; the dotted line indicates the maximum number of Orders per species in a Class; slope = 0.37.

Page 22: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

What is the Best Quality Assurance ?

• Scientific degree of encoders?• Double-encoding?• Hierarchy of checking?• Usage by custodians!• Usage by others!

Page 23: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Speed of Data Flow

What determines speed of data flow?• Bandwidth?• Trust!

Page 24: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

How to Prioritize Data Entry

What approach is best when prioritizing data entry?

• User need analysis?• Importance and quality of data?• Opportunism! Enter what is ready for

entry.

Page 25: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Enemies

Who are your most dangerous enemies?• Critiques?• Jealous colleagues?• Unconvinced donors?• Institutions!

Page 26: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Data Encoders

Who are the best data encoders?• Students?• Long-time staff?• Experts?• Women!

Page 27: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Members of the FishBase Team in 1998

Page 28: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Back to Dreaming

Building the AllFish Species Portal1. Form Consortium of respective SpeciesBank

Custodians and Institutions2. Agree on Concept, Standards and Protocols3. Use FishBase Interface and Servers4. Have small AllFish Encoder and Programmer

team 5. Find modest funding from different donors6. Have AllFish up-and-running within one year

Page 29: SpeciesBank Dreams and Realities Rainer Froese IfM-GEOMAR March 2005

Don’t Dream It

Be It