published by adam on Wed, 11/14/2012 - 16:02
It's beginner's night in Chicago. Alongside Chase Carpenter, Paul Teetor, and Jeffrey Ryan, I'm teaching Data Munging 101 in R at Jak's tap. The meetup information is available here.
Here are my slides, as promised:
Feel free to download, use this, share it---or ask me questions.
published by adam on Thu, 10/04/2012 - 13:31
published by adam on Mon, 06/04/2012 - 20:31
In Part 2, we showed how to add recession shading to a plot of American Beards over time, and did some diagnostics to check whether 19th Century Americans grew recession beards. (Spoiler alert: it appears they did not.) In Part 1, we showed how to plot the series in the first place. Today, we're going to look at the beardly trend over the period. We all know about the gilded age popularity of mutton chops and sideburns, but were full beards on the rise or on the decline between 1866 and 1911? And more importantly, what can this period tell us about beards of the future (in the past)?!
What you'll learn
-
How to apply linear smoothing to a time-series plot in ggplot2
-
How to interpret that and how not to interpret that. (The difference between interpolation and extrapolation.)
-
Whether beards were getting "trendier" or trending less during the tail end of the 19th Century.
-
What date we're all going to have beards.
published by adam on Tue, 05/29/2012 - 13:18
In Part I, we showed how to plot a time series of the change in American beards over time, using a dataset from Robert Hyndman's time series data library. Today, we're going to look at whether the dramatic changes in American male beardfulness seem related to the economy. Did Americans grow recession beards in response to the Panic of 1873? Out of work, did they forgo their frequent trips to the barber (since Gillette didn't invent the personal safety razor until 1904 so they could do it themselves)? Did they go on their job interviews with a face full of mutton chops and just never get called back (by telegram)?
What you'll learn
-
How to apply recession shading, according to the NBER's recession dates, to a time series---easily.
-
How to change the color of the recession shading
-
Whether 19th century gentlemen grew recession beards after losing their jobs in algo trading, you know, just taking some time off to study graphic design for a little bit.
published by adam on Mon, 05/28/2012 - 17:45
If you use R for time series analysis, chances are you've used Robert J. Hyndman's excellent forecast tools. I recently stumbled on his time series data library where I found just the data set I've been looking for to show some R time series plotting tricks:
http://robjhyndman.com/tsdldata/roberts/beards.dat
It's the percentage of American men with full beards reported annually. Nothing serious here, but absurdly perfect for a set of posts to share a couple things that took me a while to learn when plotting in R.
What you'll learn
-
How to grab data from a plaintext source on the web, stripping header information
-
How to convert a list of data with a known start time and end time into an xts time series object
-
How to convert xts to a data.frame for plotting in ggplot2
-
Aesthetics for red data points with dotted line interpolation
-
How much American beardfulness there was in the late 19th century and early 20th.
published by adam on Sun, 05/13/2012 - 20:25
Just a minor note for the observant---I'm reprinting and reposting the ARIMA sector reports since March 12. I was using ggplot2 as my graphics package (in R), which changed the way it formatted dates in a recent upgrade. Since March, this meant the ARIMA forecast plots had the date format "2012-03-12" rather than simply listing full months by name, such as "March." Past data was rerun, so everything else is identical, but the format of the date axis has been updated to the original, monthly format for purely aesthetic reasons.
published by adam on Tue, 05/08/2012 - 13:44
Gather 'round, R-users, forecasters, algo traders, and financial analysts. The R in Finance conference is this Friday and Saturday, May 11 & 12, in Chicago. All your favorite buzzwords, from algo trading, to data scientist, to big data, will be there---with talks given by the top experts in the R landscape. Jeff Ryan's pre-conference session on processing full-market data looks particularly solid.
published by adam on Wed, 01/18/2012 - 10:15
This is a repost from the R-bloggers mailing list, with a quick script showing credit rating on a global map. It displays sovereign credit ratings by S&P, Fitch, Moody's, and Chinese rating firm, Dagong, and demonstrates how easy it's become to create beautiful data visualizations.