Building Infrastructure with R

Visit our Meetup page for more details and to RSVP.

Date: Thursday, August 9, 2018

Time: 6:30pm

Speakers: Soumya Kalra and Emily Dodwell

Host: The New York Times

For this event we will explore how to build tools and infrastructure with R.


6:15-6:30pm Introductions and Social

6:30- 6:45 pm NYT announcements (Data, Tech, HR)

6:45-6:55pm R-Ladies New York Announcements

6:55-7:25pm Object Oriented Programming in R

7:25-7:55pm Big Data in R with Small Prototypes: Scaling

Research Workflows from [/Local] to [Cloud/Cluster]

7:55-8:00pm Community Announcements and News/Move to the 5th Floor for food/refreshments

8:00-8:15pm Networking.

Object Oriented Programming in R

In this talk, Soumya will provide an introduction to using these programming techniques in workflows and project development. She will primarily focus on building functions, S3 classes, as well as the most recent R6 classes. She will also provide the resources she used in her learning journey.

Soumya is a quantitative analyst at the New York Federal Reserve where she focusing on developing models and tools as related to stress testing work. She is an organizer on the R-Ladies New York Board and on the committee for the R in Finance.

Big Data in R with Small Prototypes: Scaling Research Workflows from [/Local] to [Cloud/Cluster]

In this talk motivated by a recent project, Emily will discuss the tools her team at AT&T Labs explored when their typical workflow to process a data set and build a machine learning model in R would not scale due to the size of the data. She will provide some considerations for the data scientist faced with such a challenge, as well as a brief introduction to sparklyr. Created by the RStudio team, this package provides a dplyr interface to Spark from R, and thereby enables straightforward access to Spark’s distributed machine learning algorithms in MLlib.

Emily is a Senior Inventive Scientist in the Statistics Research Department at AT&T Labs, where she currently focuses on predictive modeling for advertising applications and the creation of interactive tools for data analysis and visualization. She is a member of R Forwards, the R Foundation taskforce on women and other under-represented groups. Prior to joining AT&T Labs in 2015, Emily taught high school math for three years at Choate Rosemary Hall. She received her M.A. in statistics from Yale University and B.A. in mathematics from Smith College.