rOpenSci & R packages for Biodiversity Analysis
Scott Chamberlain (@sckottie/@ropensci)
UC Berkeley / rOpenSci
Open science as a lego set
Open science as a lego set
open science may be hard to do
but - you can work on different components
and - individual components are worth learning
Open Data
make your data open
funders/journals often requiring this anyway
future self will thank you
Versioning: code/data/text
Versioning: code/data/text
failure proofs your work
experiment freely!
Do all work programatically
Key to reproduciblity
Most important person that wants to reproduce your work is you!
Do all work programatically
you and yourself
- one week from now
- two months from now
- & so on
important scientific programming languages
R language
used widely in biology, psychology, medicine, etc.
rapidly growing user base, companies surrounding it
includes all tools for open science workflow
though work to be done ...
Open science ecosytsem
What data do you use in your research?
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
the research workflow
Data acquisition
data manipulation/analysis/viz
writing
publish
We make data driven stories easier to tell
here are some stories ...
use case 1
McGee, M. D., Borstein, S. R., Neches, R. Y., Buescher, H. H., Seehausen, O., & Wainwright, P. C. (2015). A pharyngeal jaw evolutionary innovation facilitated extinction in Lake Victoria cichlids. Science, 350(6264), 1077–1079
rOpenSci Biodiversity Tools
Taxonomic IDs
always try to move from:
taxonomic name -- to
taxonomic ID -- to
whatever other data
how do you clean your data?
Geospatial: conversion between data/spatial data formats - geojsonio
geojson_list - convert to GeoJSON as R list
geojson_json - convert to GeoJSON as JSON
geojson_read/geojson_write - read/write GeoJSON
from most R object types + many spatial data formats
talk to us
what would you like to see?
what open data is too hard to get?