For as long as I have been working with water data, I wanted to construct a line graph superimposed on a box and whisker plot where the boxes show the distribution of values and the line shows some current condition. One of my favorite things about ggplot2 is that it allows users to construct complex combinations of graphs in ways that make sense.
This plot shows daily mean flow values in the Schuylkill River (the blue line) against box plots of the rolling 20-year set of daily mean values. In this instance, flows in the lower part of the river are higher than the 75th percentile flow value, while flows in the upper part of the river are within the interquartile range. Each point (and box plot) represents data from a USGS gage retrieved using the dataRetrieval library. The vertical step increases in the blue line are inputs from tributary gages. This particular image was developed as part of an automated dashboard hosted by the Delaware River Basin Commission. R scripts, run in batch mode, are called by Windows Task Scheduler. The scripts run overnight every night to retrieve data, develop new graphs, and upload the graphs to the dashboard web page providing a near real-time visualization of river conditions.
4
Can you share the R code?
ReplyDeleteThis graph is great! I would really like to see the code for this. Our team has been doing similar types of analyses on tributaries to drinking water reservoirs in Illinois
ReplyDeleteThis is nice, it would be great if you could share code openly as repository or snippet. The Delaware River Basin site doesn't provide code either.
ReplyDeleteSend me an e-mail at JYagecic@gmail.com My preference is to share code via some sort of collaborative project if possible
ReplyDelete