class: center, middle, inverse, title-slide # Visualization with
ggplot
## Distributions ### SuffolkEcon --- # Painting with data You can make a `ggplot` in four steps: -- _Step 1._ Choose your **data** -- _Step 2._ Make a **blank canvas** with the `ggplot()` and `aes()` functions -- _Step 3._ Choose your **paint** to visualization the data with a `geom_()` function (for "geometry") -- _Step 4._ **Customize** the axes, colors, and so on. --- class: inverse, center, middle # 1. Histogram `geom_histogram()` --- count: false ### 1. Histogram .panel1-hist1-auto[ ```r # 1. Data *gapminder ``` ] .panel2-hist1-auto[ ``` # A tibble: 1,704 x 6 country continent year lifeExp pop gdpPercap <fct> <fct> <int> <dbl> <int> <dbl> 1 Afghanistan Asia 1952 28.8 8425333 779. 2 Afghanistan Asia 1957 30.3 9240934 821. 3 Afghanistan Asia 1962 32.0 10267083 853. 4 Afghanistan Asia 1967 34.0 11537966 836. 5 Afghanistan Asia 1972 36.1 13079460 740. 6 Afghanistan Asia 1977 38.4 14880372 786. 7 Afghanistan Asia 1982 39.9 12881816 978. 8 Afghanistan Asia 1987 40.8 13867957 852. 9 Afghanistan Asia 1992 41.7 16317921 649. 10 Afghanistan Asia 1997 41.8 22227415 635. # … with 1,694 more rows ``` ] --- count: false ### 1. Histogram .panel1-hist1-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas * ggplot(aes(x = lifeExp)) ``` ] .panel2-hist1-auto[ data:image/s3,"s3://crabby-images/c22ca/c22caa557a7eadac11fbc00624c5b652141d6251" alt=""<!-- --> ] --- count: false ### 1. Histogram .panel1-hist1-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_histogram() ``` ] .panel2-hist1-auto[ data:image/s3,"s3://crabby-images/76f32/76f321d28534945c12bc4854012573f0f1e90d9a" alt=""<!-- --> ] --- count: false ### 1. Histogram .panel1-hist1-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint geom_histogram() + # 4. customize: titles * labs(x = "Average Life Expectancy", * y = "Frequency", * title = "How long will you live?", * subtitle = "1952 to 2007") ``` ] .panel2-hist1-auto[ data:image/s3,"s3://crabby-images/839d0/839d03fcfd2277b787726c5aef565baddb78e86d" alt=""<!-- --> ] --- count: false ### 1. Histogram .panel1-hist1-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint geom_histogram() + # 4. customize: titles labs(x = "Average Life Expectancy", y = "Frequency", title = "How long will you live?", subtitle = "1952 to 2007") + # 4. customize: one panel per continent * facet_wrap(~continent) ``` ] .panel2-hist1-auto[ data:image/s3,"s3://crabby-images/1399e/1399e78f1283d51cf31c580e474cfbed847b7525" alt=""<!-- --> ] <style> .panel1-hist1-auto { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-hist1-auto { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-hist1-auto { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- class: inverse, center, middle # 2. Kernel Density `geom_density()` --- count: false ### 2. Kernel Density .panel1-hist2-auto[ ```r # 1. Data *gapminder ``` ] .panel2-hist2-auto[ ``` # A tibble: 1,704 x 6 country continent year lifeExp pop gdpPercap <fct> <fct> <int> <dbl> <int> <dbl> 1 Afghanistan Asia 1952 28.8 8425333 779. 2 Afghanistan Asia 1957 30.3 9240934 821. 3 Afghanistan Asia 1962 32.0 10267083 853. 4 Afghanistan Asia 1967 34.0 11537966 836. 5 Afghanistan Asia 1972 36.1 13079460 740. 6 Afghanistan Asia 1977 38.4 14880372 786. 7 Afghanistan Asia 1982 39.9 12881816 978. 8 Afghanistan Asia 1987 40.8 13867957 852. 9 Afghanistan Asia 1992 41.7 16317921 649. 10 Afghanistan Asia 1997 41.8 22227415 635. # … with 1,694 more rows ``` ] --- count: false ### 2. Kernel Density .panel1-hist2-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas * ggplot(aes(x = lifeExp)) ``` ] .panel2-hist2-auto[ data:image/s3,"s3://crabby-images/2e4d6/2e4d6d773701d5b0c0363c8c18b5681c4fc4ee56" alt=""<!-- --> ] --- count: false ### 2. Kernel Density .panel1-hist2-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_density() ``` ] .panel2-hist2-auto[ data:image/s3,"s3://crabby-images/3a483/3a483f8197585f1284ba0460d62ae79ef6afe0ec" alt=""<!-- --> ] --- count: false ### 2. Kernel Density .panel1-hist2-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint geom_density() + # 4. customize: titles * labs(x = "Average Life Expectancy", * y = "Density", * title = "How long will you live?", * subtitle = "1952 to 2007") ``` ] .panel2-hist2-auto[ data:image/s3,"s3://crabby-images/75349/75349bceb653d3fb8eb52274da90c04ed2efccee" alt=""<!-- --> ] --- count: false ### 2. Kernel Density .panel1-hist2-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint geom_density() + # 4. customize: titles labs(x = "Average Life Expectancy", y = "Density", title = "How long will you live?", subtitle = "1952 to 2007") + # 4. customize: one panel per continent * facet_wrap(~continent) ``` ] .panel2-hist2-auto[ data:image/s3,"s3://crabby-images/30730/307300749819365e8c5eea3b78521d1d3b4ac1a9" alt=""<!-- --> ] <style> .panel1-hist2-auto { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-hist2-auto { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-hist2-auto { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- class: inverse, center, middle # 3. Cumulative Distributions `stat_ecdf()` --- count: false ### 3. Cumulative Distribution .panel1-hist3-auto[ ```r # 1. Data *gapminder ``` ] .panel2-hist3-auto[ ``` # A tibble: 1,704 x 6 country continent year lifeExp pop gdpPercap <fct> <fct> <int> <dbl> <int> <dbl> 1 Afghanistan Asia 1952 28.8 8425333 779. 2 Afghanistan Asia 1957 30.3 9240934 821. 3 Afghanistan Asia 1962 32.0 10267083 853. 4 Afghanistan Asia 1967 34.0 11537966 836. 5 Afghanistan Asia 1972 36.1 13079460 740. 6 Afghanistan Asia 1977 38.4 14880372 786. 7 Afghanistan Asia 1982 39.9 12881816 978. 8 Afghanistan Asia 1987 40.8 13867957 852. 9 Afghanistan Asia 1992 41.7 16317921 649. 10 Afghanistan Asia 1997 41.8 22227415 635. # … with 1,694 more rows ``` ] --- count: false ### 3. Cumulative Distribution .panel1-hist3-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas * ggplot(aes(x = lifeExp)) ``` ] .panel2-hist3-auto[ data:image/s3,"s3://crabby-images/4b781/4b7815095d99ba1ed7a4952b1978fa4c63d620ba" alt=""<!-- --> ] --- count: false ### 3. Cumulative Distribution .panel1-hist3-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * stat_ecdf() ``` ] .panel2-hist3-auto[ data:image/s3,"s3://crabby-images/bb1bf/bb1bf3007d822d7e9d1454c4535dbcbd10146892" alt=""<!-- --> ] --- count: false ### 3. Cumulative Distribution .panel1-hist3-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint stat_ecdf() + # 4. customize: titles * labs(x = "Average Life Expectancy", * y = "Proportion", * title = "How long will you live?", * subtitle = "1952 to 2007") ``` ] .panel2-hist3-auto[ data:image/s3,"s3://crabby-images/af858/af85823473f2fc09ce81aee2a8ab6f327e90831b" alt=""<!-- --> ] --- count: false ### 3. Cumulative Distribution .panel1-hist3-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint stat_ecdf() + # 4. customize: titles labs(x = "Average Life Expectancy", y = "Proportion", title = "How long will you live?", subtitle = "1952 to 2007") + # 4. customize: one panel per continent * facet_wrap(~continent) ``` ] .panel2-hist3-auto[ data:image/s3,"s3://crabby-images/4290a/4290a876f3edd19d3e0ff0f87c50704ffd62afea" alt=""<!-- --> ] <style> .panel1-hist3-auto { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-hist3-auto { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-hist3-auto { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- class: inverse, center, middle # 4. Boxplot `geom_boxplot()` --- count: false ### 4. Boxplot .panel1-hist4-auto[ ```r # 1. Data *gapminder ``` ] .panel2-hist4-auto[ ``` # A tibble: 1,704 x 6 country continent year lifeExp pop gdpPercap <fct> <fct> <int> <dbl> <int> <dbl> 1 Afghanistan Asia 1952 28.8 8425333 779. 2 Afghanistan Asia 1957 30.3 9240934 821. 3 Afghanistan Asia 1962 32.0 10267083 853. 4 Afghanistan Asia 1967 34.0 11537966 836. 5 Afghanistan Asia 1972 36.1 13079460 740. 6 Afghanistan Asia 1977 38.4 14880372 786. 7 Afghanistan Asia 1982 39.9 12881816 978. 8 Afghanistan Asia 1987 40.8 13867957 852. 9 Afghanistan Asia 1992 41.7 16317921 649. 10 Afghanistan Asia 1997 41.8 22227415 635. # … with 1,694 more rows ``` ] --- count: false ### 4. Boxplot .panel1-hist4-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas * ggplot(aes(x = continent, y = lifeExp)) ``` ] .panel2-hist4-auto[ data:image/s3,"s3://crabby-images/c6937/c6937332b704cee80cd68312dad875d96ef57b0e" alt=""<!-- --> ] --- count: false ### 4. Boxplot .panel1-hist4-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint * geom_boxplot() ``` ] .panel2-hist4-auto[ data:image/s3,"s3://crabby-images/66e6b/66e6bf106e9fb66c3628873747f64031c71f0ef7" alt=""<!-- --> ] --- count: false ### 4. Boxplot .panel1-hist4-auto[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles * labs(x = "Continent", * y = "Life Expectancy", * title = "How long will you live?", * subtitle = "1952 to 2007") ``` ] .panel2-hist4-auto[ data:image/s3,"s3://crabby-images/51165/51165d77f49c6be0947abeee2c4db27521cd9cf5" alt=""<!-- --> ] <style> .panel1-hist4-auto { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-hist4-auto { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-hist4-auto { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- class: inverse, center, middle # 5. Themes `theme_` --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_gray() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/17c78/17c789129d273f3c6b236510a2a222d60f727970" alt=""<!-- --> ] --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_minimal() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/df406/df406969484be04628290d43941678861cdd5dc6" alt=""<!-- --> ] --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_bw() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/dabba/dabba54b65bfab5aa68617b79d6de230ee314680" alt=""<!-- --> ] --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_dark() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/cec57/cec57676574fe03ec490a250ccad248b36579f91" alt=""<!-- --> ] --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_light() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/7694e/7694ed73b5af66048404bc50db32ffc977babda0" alt=""<!-- --> ] --- count: false ### 5. Themes .panel1-theme-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = continent, y = lifeExp)) + # 3. paint geom_boxplot() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") + # customize: theme * theme_classic() ``` ] .panel2-theme-rotate[ data:image/s3,"s3://crabby-images/f3af9/f3af97cc40cc12741e7b023eb9b20e1d71b4c21a" alt=""<!-- --> ] <style> .panel1-theme-rotate { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-theme-rotate { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-theme-rotate { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- class: inverse, center, middle # 6. Color --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_histogram() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/25d12/25d129577e90183b9c7abc74d75bceb5301ae907" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_histogram(fill = "blue") + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/da9d8/da9d8ad0270e2172b2ebb2a4c9ef0d5181b304af" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_histogram(aes(fill = continent)) + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/bb384/bb3841f4ff92565b031fa93958da3cf899bffa21" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_histogram(aes(fill = continent), alpha = 0.75) + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/ab458/ab4587f90a2fe3f93b522170edc7d4f9d2f54cc0" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_density() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/40014/4001490cdc7eb164690312edd339ba5dbe981085" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_density(fill = "purple") + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/f6071/f607192365e83106c691b63278b46721116ed996" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_density(aes(fill = continent)) + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/dbb1a/dbb1a6382787a2c04fe96301410c6c51ab057883" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * geom_density(aes(fill = continent), alpha = 0.75) + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/4eabe/4eabe10738363e44111547cc11a991bd0757e1cd" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * stat_ecdf() + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/b488c/b488c2bea522a5b92bc6f86ce430671fe9ea8a62" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * stat_ecdf(color = "red") + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/9ee03/9ee037e1b8864d2a04fb0a04a27ec6431faff41b" alt=""<!-- --> ] --- count: false ### 6. Color .panel1-color-rotate[ ```r # 1. Data gapminder %>% # 2. blank canvas ggplot(aes(x = lifeExp)) + # 3. paint * stat_ecdf(aes(color = continent)) + # 4. customize: titles labs(x = "Continent", y = "Life Expectancy", title = "How long will you live?", subtitle = "1952 to 2007") ``` ] .panel2-color-rotate[ data:image/s3,"s3://crabby-images/90b67/90b6790dc03749f5fd12d456b32defb4dc8bf075" alt=""<!-- --> ] <style> .panel1-color-rotate { color: black; width: 38.6060606060606%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel2-color-rotate { color: black; width: 59.3939393939394%; hight: 32%; float: left; padding-left: 1%; font-size: 80% } .panel3-color-rotate { color: black; width: NA%; hight: 33%; float: left; padding-left: 1%; font-size: 80% } </style> --- # Acknowledgements These slides were made with the `flipbookr` package by [Gina Reynolds](https://github.com/EvaMaeRey/flipbookr). Last update: June 2021 <style type="text/css"> .remark-code{line-height: 1.5; font-size: 80%} @media print { .has-continuation { display: block; } } code.r.hljs.remark-code{ position: relative; overflow-x: hidden; } code.r.hljs.remark-code:hover{ overflow-x:visible; width: 500px; border-style: solid; } </style>