rOpenSci - Biodiversity Data in R
Scott Chamberlain
UC Berkeley / rOpenSci
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
Many researchers using rOpenSci R pkgs for BIS data
rgbif citations: at least 28
taxize citations: at least 41
rfishbase citations: at least 13
rOpenSci R packages for Biodiversity data:
GBIF, iDigBio, EU BON, UK NBN, EOL, iNaturalist, VertNet, USGS BISON, Ecoengine, eBird, AntWeb, OBIS, ALA, Pangaea, FishBase, Seaaroundus, BHL
rOpenSci R packages for Taxonomic data:
ITIS, COL, WoRMS, Theplantlist, Global Names, Tropicos, IPNI, NCBI, Index Fungorum, ION, TOL, NaureServe
we're developing taxonomic classes for R
Linking disparate data
R is one place where this happens
Occurrences Environmental
Occurrences Genetic
Occurrences Geographic
Occurrences Literature
rOpenSci serves as a conduit
Between researchers/other users and data providers
Problems with webservices
Data quality problems
We can speak languages of both groups
rOpenSci also serves data
e.g.: Fishbase API
We're interested to do more like this to help small data providers/those that can't do webservices on their own
Funding rOpenSci
Funded by private US based foundations
Exploring:
Gov't grants
Member dues (universities)
Consulting
Industry support
Funding rOpenSci
rOpenSci parterning with BIS on grants makes a lot of sense
delivering data to researchers in their reproducible workflow