Skip to content

Commit da9d10e

Browse files
authored
Merge pull request #230 from SISBID/summarization-25-updates
Summarization - a few more small tweaks!
2 parents 0dde09c + cfa6a1a commit da9d10e

File tree

3 files changed

+18
-14
lines changed

3 files changed

+18
-14
lines changed

lecture_notes/Data_Summarization.Rmd

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -212,18 +212,16 @@ Use the [`across`](https://dplyr.tidyverse.org/reference/across.html) function w
212212
dropouts <- read_delim("https://sisbid.github.io/Data-Wrangling/data/dropouts.txt", delim = "/")
213213
dropouts %>%
214214
group_by(ETHNIC) %>%
215-
summarize(across(
216-
c(D9, D10, D11, D12),
217-
~ sum(.x)
218-
))
215+
summarize(across(c(D9, D10, D11, D12), ~sum(.x)))
219216
```
220217

221218
## Select different columns based on the data class
222219

223-
Use `?tidyr_tidy_select` functions: `where(is.numeric)`
220+
Use `?tidyr_tidy_select` functions like `where(is.numeric)`!
224221

225222
```{r}
226223
dropouts %>%
224+
group_by(ETHNIC) %>%
227225
summarize(across( where(is.numeric), ~ sum(.x, na.rm = TRUE)))
228226
```
229227

@@ -241,7 +239,7 @@ summary(dropouts)
241239
- `count(x)`: what unique values do you have?
242240
- `group_by()`: changes all subsequent functions
243241
- combine with `summarize()` to get statistics per group
244-
- combine with `across()` to programmatically select columns
242+
- combine with `across()` to select several columns
245243
- `summary(x)`: quantile information
246244

247245
https://sisbid.github.io/Data-Wrangling/labs/data-summarization-lab.Rmd

lecture_notes/Data_Summarization.html

Lines changed: 14 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -18417,10 +18417,7 @@ <h1 data-config-title><!-- populated from slide_config.json --></h1>
1841718417
<pre class = 'prettyprint lang-r'>dropouts &lt;- read_delim(&quot;https://sisbid.github.io/Data-Wrangling/data/dropouts.txt&quot;, delim = &quot;/&quot;)
1841818418
dropouts %&gt;%
1841918419
group_by(ETHNIC) %&gt;%
18420-
summarize(across(
18421-
c(D9, D10, D11, D12),
18422-
~ sum(.x)
18423-
))</pre>
18420+
summarize(across(c(D9, D10, D11, D12), ~sum(.x)))</pre>
1842418421

1842518422
<pre ># A tibble: 9 × 5
1842618423
ETHNIC D9 D10 D11 D12
@@ -18437,15 +18434,24 @@ <h1 data-config-title><!-- populated from slide_config.json --></h1>
1843718434

1843818435
</article></slide><slide class=""><hgroup><h2>Select different columns based on the data class</h2></hgroup><article id="select-different-columns-based-on-the-data-class">
1843918436

18440-
<p>Use <code>?tidyr_tidy_select</code> functions: <code>where(is.numeric)</code></p>
18437+
<p>Use <code>?tidyr_tidy_select</code> functions like <code>where(is.numeric)</code>!</p>
1844118438

1844218439
<pre class = 'prettyprint lang-r'>dropouts %&gt;%
18440+
group_by(ETHNIC) %&gt;%
1844318441
summarize(across( where(is.numeric), ~ sum(.x, na.rm = TRUE)))</pre>
1844418442

18445-
<pre ># A tibble: 1 × 18
18443+
<pre ># A tibble: 9 × 18
1844618444
ETHNIC E7 E8 E9 E10 E11 E12 EUS ETOT D7 D8
1844718445
&lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt; &lt;dbl&gt;
18448-
1 282304 473574 472340 487547 486085 481521 484169 6079 1945401 1453 1138
18446+
1 0 2922 2709 3140 3291 2776 2345 43 11595 66 52
18447+
2 1 2523 2589 2808 2886 2889 3020 33 11636 16 15
18448+
3 2 44199 43898 43343 43757 45840 43001 596 176537 69 37
18449+
4 3 2390 2284 2436 2352 2471 2571 42 9872 9 7
18450+
5 4 12210 12883 13429 14037 14273 14469 233 56441 12 8
18451+
6 5 254745 252583 264302 260201 252458 253193 2680 1032834 682 550
18452+
7 6 26383 26786 28497 28731 28696 30799 485 117208 217 164
18453+
8 7 113498 115139 115593 117592 119308 121771 1829 476093 323 267
18454+
9 9 14704 13469 13999 13238 12810 13000 138 53185 59 38
1844918455
# ℹ 7 more variables: D9 &lt;dbl&gt;, D10 &lt;dbl&gt;, D11 &lt;dbl&gt;, D12 &lt;dbl&gt;, DUS &lt;dbl&gt;,
1845018456
# DTOT &lt;dbl&gt;, YEAR &lt;dbl&gt;</pre>
1845118457

@@ -18500,7 +18506,7 @@ <h1 data-config-title><!-- populated from slide_config.json --></h1>
1850018506

1850118507
<ul>
1850218508
<li>combine with <code>summarize()</code> to get statistics per group</li>
18503-
<li>combine with <code>across()</code> to programmatically select columns</li>
18509+
<li>combine with <code>across()</code> to select several columns</li>
1850418510
</ul></li>
1850518511
<li><code>summary(x)</code>: quantile information</li>
1850618512
</ul>
1.34 KB
Binary file not shown.

0 commit comments

Comments
 (0)