class: bottom, left, title-slide # Data Processing with R
Session 1
### Garrick Aden-Buie ### July 18, 2018 --- class: inverse center middle # Who are we? ??? - Name, what do you do here?, cat/dog/none, first computer --- class: center middle <img src="../images/garrick.png" width="20%" /> ## Garrick Aden-Buie ### Cancer Epidemiology<br>Dr. Travis Gerke #### Garrick.Aden-Buie@moffitt.org @grrrck<br> garrickadenbuie.com --- class: inverse center middle # What's so cool about R? --- background-image: url("../images/r-logo.svg") background-position: 90% 60% background-size: 350px ## What is R? > R is a free software environment for statistical computing and graphics. - Both interactive environment and programming language - Designed by Statisticians - Quirky - <http://r-project.org> ??? Ross and Robert --- background-image: url("../images/tidyverse-hex-logo.png") background-position: 90% 80% background-size: 300px ## tidyverse > The tidyverse is an opinionated collection of R packages designed for data science. - Consistent data structure and grammar - A common design philosophy - Everything fits together nicely - <https://tidyverse.org> --- ## Falling into the Pit of Despair .center[ ![](../images/typing-hurts.gif) ] --- ## Falling into the Pit of ~~Despair~~ Success .center[ ![](../images/typing-awesome.gif) ] -- .center[ The biggest bottleneck is cognitive: what you want to do, and<br>how you tell the computer to do that ] --- ## Two options for data processing .pull-left.center[ ![](../images/excel.png) **GUI** ] .pull-right.center[ ![](../images/command-line.png) **Command Line** ] --- ## Two options for data processing .pull-left.center[ ![](../images/excel-sort-warning.png) .big[🤔] ] .pull-right.center[ ![](../images/command-line.png) **Command Line** ] --- layout: true ## Why learn how to program? --- .footer[https://speakerdeck.com/hadley/should-all-statistics-students-be-programmers] .third.left-column[ ### Code is text ] -- .two-third.right-column[ .large[<kbd>Ctrl</kbd>+<kbd>C</kbd> & <kbd>Ctrl</kbd>+<kbd>V</kbd>] ] --- .third.left-column[ ### Code is text ### Code is read-able ] .two-third.right-column[ ```r bookings %>% mutate( total_price = price_per_night * room_nights ) %>% summarise( n = n(), n_stayed = sum(status == "stayed"), mean_price = mean(total_price) ) ``` ] --- .third.left-column[ ### Code is text ### Code is read-able ### Code is shareable ] .two-third.right-column.center[ ![](../images/github-gerkelab-repos.png) ] --- .third.left-column[ ### Code is text ### Code is read-able ### Code is shareable ] .two-third.right-column.center[ ![](../images/github-gerkelab-r-code.png) ] --- .third.left-column[ ### Code is text ### Code is read-able ### Code is shareable ### Code is open ] .two-third.right-column.center[ ![](../images/hexwall-user2018.jpg) ] .footer[<https://blog.mitchelloharawild.com/blog/user-2018-feature-wall/>] --- layout: false class: inverse center middle # Joining the R Community --- # Goals for this course .large[ 1. Hands-on, applicable 2. Empowering 3. Collaborative ] --- ## Learning .center[ <img src="../images/tweet_kierisi_learning_r.jpg" width="32%" class="bordered top" /> <img src="../images/tweet_google_gather.jpg" width="32%" class="bordered top" /> <img src="../images/tweet_phd_google.jpg" width="32%" class="bordered top" /> ] .footer.muted[ https://twitter.com/kierisi/status/898534740051062785 https://twitter.com/kaelen_medeiros/status/1018636793221189632 https://twitter.com/_roryturnbull/status/1018180596965621760 ] --- ## Community Leaders .card[ .card-img[![](../images/r-people-hadleywickham.jpg)] ## Hadley Wickham ### [hadley.nz](http://hadley.nz/) Chief Scientist at RStudio, I build tools (computational and cognitive) that make data science easier, faster, and more fun. ] .card[ .card-img[![](../images/r-people-jennybryan.jpg)] ## Jenny<br>Bryan ### [github.com/jennybc](https://github.com/jennybc) Software engineer @rstudio, humane #rstats, adjunct prof @UBC where I created @STAT545, part of @ropensci ] .card[ .card-img[![](../images/r-people-juliasilge.jpg)] ## Julia<br>Silge ### [juliasilge.com](https://juliasilge.com) Data science and visualization at @StackOverflow, #rstats, author of Tidy Text Mining with R ] .card[ .card-img[![](../images/r-people-drob.jpg)] ## David Robinson ### [varianceexplained.org](https://varianceexplained.org) Chief Data Scientist at @DataCamp, #rstats fan/evangelist ] --- ## Online Community .card[ ![](../images/r4dscommunity.jpg) ### [@R4DScommunity](https://twitter.com/R4DScommunity) Online community working through [R for Data Science](http://r4ds.had.co.nz/) ] .right-column.center[ ![](../images/tweet_tidytuesday.jpg) ] --- class: inverse center middle # Course Overview --- background-image: url("../images/tidyverse-workflow-01.png") background-size: contain background-color: #FAF8F2 .footer[[R4DS](http://r4ds.had.co.nz/)] --- background-image: url("../images/tidyverse-workflow-02.png") background-size: contain background-color: #FAF8F2 .footer[[R4DS](http://r4ds.had.co.nz/)] --- background-image: url("../images/tidyverse-workflow-03.png") background-size: contain background-color: #FAF8F2 .footer[[R4DS](http://r4ds.had.co.nz/)] --- # Before we get started... .big[ - 🙋🏻‍♀ Installed **R** - 🙋🏽‍♂️ Installed **RStudio** - 👏🏽 Installed .pkg[tidyverse] ]