Claus Wilke Profile Banner
Claus Wilke Profile
Claus Wilke

@ClausWilke

19,951
Followers
977
Following
2,309
Media
22,354
Statuses

Computational biologist, data scientist, digital artist | he, him | | Opinions are my own and do not represent UT Austin.

Austin, TX
Joined August 2013
Don't wanna be here? Send us removal request.
@ClausWilke
Claus Wilke
3 years
This spring, I'm teaching a new class on data visualization with R. I'm posting all materials as I go. Feel free to follow along. Each lecture has slides and an interactive worksheet. #rstats
Tweet media one
Tweet media two
Tweet media three
Tweet media four
104
3K
13K
@ClausWilke
Claus Wilke
5 years
The complete draft of "Fundamentals of Data Visualization" is now available online.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
58
2K
5K
@ClausWilke
Claus Wilke
6 years
Very excited to announce my latest project, a book on data visualization. Approximately half of the book is written, and all completed chapters are now available as online preview. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
59
1K
4K
@ClausWilke
Claus Wilke
5 years
I've set up a website with annotated examples of how to make publication-quality figures with ggplot2. #rstats #ggplot
Tweet media one
Tweet media two
Tweet media three
Tweet media four
16
1K
3K
@ClausWilke
Claus Wilke
3 years
ggiraph + shiny is fantastic! ~30 lines of code. #dataviz #rstats
23
394
3K
@ClausWilke
Claus Wilke
3 years
Slides on principal components analysis with R. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
13
368
2K
@ClausWilke
Claus Wilke
4 years
Simple function plotting in ggplot2. Coming in the next release, early this summer. #rstats #ggplot2
Tweet media one
Tweet media two
Tweet media three
23
310
2K
@ClausWilke
Claus Wilke
4 years
I also don’t get this whole R vs python thing. They clearly have got very different strengths and weaknesses. R: analyze tabular data and make visualizations. python: strangle small mammals to death and crush their bones.
@MaureenBug
Dr Maureen Berg
4 years
tired: R is bad, you should learn python wired: literally anything else
17
11
295
18
299
2K
@ClausWilke
Claus Wilke
5 years
I received the first hardcopy of Fundamentals of Data Visualization. Preorders should start to ship very soon. You can order your copy here:
Tweet media one
Tweet media two
Tweet media three
Tweet media four
15
247
2K
@ClausWilke
Claus Wilke
4 years
Now on CRAN: ggtext. Improved text rendering support for ggplot2. #rstats
Tweet media one
Tweet media two
Tweet media three
Tweet media four
19
341
2K
@ClausWilke
Claus Wilke
3 years
Slides on redundant coding. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
17
207
2K
@ClausWilke
Claus Wilke
3 years
The most important topic for all ggplot2 users: Getting things into the right order. #rstats #ggplot2
Tweet media one
Tweet media two
Tweet media three
Tweet media four
7
220
1K
@ClausWilke
Claus Wilke
7 years
Over the years, movies have converged to a length of ~100 min. 4 lines of code with ggjoy. #rstats
Tweet media one
31
655
1K
@ClausWilke
Claus Wilke
5 years
Draft of the last missing chapter: Visualizing uncertainty. #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
11
289
1K
@ClausWilke
Claus Wilke
5 years
As news outlets are creating increasingly complex data analysis and visualization projects, we should start demanding fully reproducible analysis scripts with every story.
@DataProgress
Data for Progress
5 years
We re-created this NYT plot. The chart does not actually show a trendline and the relationship is mostly explained by Social Security.
Tweet media one
Tweet media two
47
346
2K
19
315
1K
@ClausWilke
Claus Wilke
5 years
Markdown text inside ggplot2. #rstats #ggplot2
Tweet media one
Tweet media two
19
224
948
@ClausWilke
Claus Wilke
4 years
New blog post: PCA tidyverse style. I've been struggling with doing PCA using an idiomatic tidyverse approach. Now I think I've figured it out. #rstats #tidyverse
7
200
937
@ClausWilke
Claus Wilke
4 years
The ggplot2 extension gallery now lives at . Please update your bookmarks. The old link is no longer safe to visit. #rstats #ggplot2
Tweet media one
7
242
851
@ClausWilke
Claus Wilke
7 years
4. At that point, a PhD would not be a viable choice anymore, except for the independently wealthy.
22
309
782
@ClausWilke
Claus Wilke
6 years
Now on CRAN: ggridges 0.5, with support for shading by probability, points overlaid on density curves, rainclouds, and rugs. #rstats #dataviz #ggridges
Tweet media one
Tweet media two
Tweet media three
Tweet media four
10
323
789
@ClausWilke
Claus Wilke
7 years
2. Currently, STEM PhDs make about $30K per year and get tuition wavers. They can do their PhD without taking on debt, but barely so.
42
285
717
@ClausWilke
Claus Wilke
5 years
Attention #rstats users: In a few weeks, I'm going to release #cowplot 1.0, and there are going to be some important changes from the current release. I encourage you to check out the development version now and verify things work for you. Thread.
Tweet media one
5
192
739
@ClausWilke
Claus Wilke
7 years
3. If they had to pay taxes on a calculated $50-$60K income (incl. tuition), the actual income would not cover living expenses anymore.
9
299
683
@ClausWilke
Claus Wilke
10 months
@kareem_carr Alternative explanation: 1/3 = 0.33333... 2/3 = 0.66666... 3/3 = 1/3 + 2/3 = 1 3/3 = 0.33333... + 0.66666... = 0.99999... Therefore, 1 = 0.99999...
14
20
705
@ClausWilke
Claus Wilke
4 years
Spatial plotting just improved a lot in the development version of ggplot2. In a nutshell, you can now mix and match regular geoms with `geom_sf()` and `coord_sf()`. If you're doing any geospatial plotting, please test this out. 1/n #rstats #ggplot2
Tweet media one
6
146
699
@ClausWilke
Claus Wilke
5 years
Things are getting real. Hard-copy proofs of "Fundamentals of Data Visualization." #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
13
74
692
@ClausWilke
Claus Wilke
6 years
Sad to announce I'll have to abandon my project of writing a book on #dataviz entirely in #rstats , with all figures programmatically generated. I just learned it is not possible. Ignore the 21 chapters already online. They are a mirage.
10
66
672
@ClausWilke
Claus Wilke
3 years
Functions and functional programming. #rstats
Tweet media one
Tweet media two
Tweet media three
Tweet media four
9
89
652
@ClausWilke
Claus Wilke
3 years
Interactive plots with ggiraph. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
93
647
@ClausWilke
Claus Wilke
4 years
For comparison, the budget of the National Science Foundation, which funds all areas of scientific research in the US, is $7,100,000,000.
@Public_Citizen
Public Citizen
4 years
The NYPD’s budget is $6,000,000,000 That’s more than the city spends on health, homelessness, youth development and workforce development combined.
2K
69K
190K
5
268
633
@ClausWilke
Claus Wilke
7 years
8. The author of the quoted article doesn’t even understand the gravity of this situation.
9
117
532
@ClausWilke
Claus Wilke
6 years
I've now mostly figured out how to implement bivariate color scales in #ggplot2 . The guide box still needs some visual tweaking, though. @lenkiefer @hadleywickham
Tweet media one
Tweet media two
11
110
571
@ClausWilke
Claus Wilke
7 years
7. So, no more STEM PhDs at state schools in the US.
5
194
527
@ClausWilke
Claus Wilke
2 years
This got a lot of attention. I thought everybody knew about the book (and the free website) by now, but maybe not.
@rmarkdown
R Markdown
2 years
You can read the free ebook "Fundamentals of Data Visualization" by Claus O. Wilke on #dataviz with #rstats on the following website of the named author:
Tweet media one
18
794
3K
7
94
545
@ClausWilke
Claus Wilke
6 years
I spent way too much time this weekend writing orthographic projection code. Now that it works, let's celebrate with a spinning globe. #rstats #dataviz
10
49
521
@ClausWilke
Claus Wilke
7 years
ggjoy 0.2.0 is now on CRAN. Plenty of new geoms, stats, features vs. 0.1.0. #rstats #joyplot #ggjoy #ggplot #dataviz
Tweet media one
Tweet media two
Tweet media three
5
193
498
@ClausWilke
Claus Wilke
6 years
I find this quite useful to understand the confidence region of a linear regression.
11
144
494
@ClausWilke
Claus Wilke
6 years
The latest #ggplot2 has a `clip = "off"` option to allow drawing outside of the plot panel. This allows for all sorts of neat plotting tricks. E.g., direct labeling. ("Toyota Corolla" extends beyond the plot area.) #rstats #dataviz
Tweet media one
Tweet media two
11
110
475
@ClausWilke
Claus Wilke
5 years
The next version of the isoband package will be able to draw labeled contour lines. #rstats
Tweet media one
Tweet media two
7
71
470
@ClausWilke
Claus Wilke
7 years
5. Maybe private schools could somehow work around this by charging different tuition for grad students and undergrads.
6
133
434
@ClausWilke
Claus Wilke
7 years
6. But state schools would likely not be able to do that, due to various laws and regulations.
7
136
440
@ClausWilke
Claus Wilke
6 years
I had the evolution once. So painful. The selection I could handle, but the drift is tough. And it's so unpredictable.
@newscientist
New Scientist
6 years
Evolution is making us sick and for the first time we can stop it
Tweet media one
9
61
210
6
68
452
@ClausWilke
Claus Wilke
7 years
I've always wanted to make a plot like this. #rstats #ggplot #cowplot
Tweet media one
Tweet media two
10
129
461
@ClausWilke
Claus Wilke
5 years
Designer: I don't have software to make a pie chart. Manager: Just manually draw two differently colored wedges and write "78%" into the bigger one. Nobody will be able to tell if it's off by a little.
14
69
455
@ClausWilke
Claus Wilke
4 years
This is why we’ve gone away from installing software on student computers in my data science class. Installing python is literally harder than anything you could reasonably do with it (in an intro class) once it’s installed properly.
@daniela_witten
Daniela Witten
4 years
I took my 1st CS course at age 18. I almost dropped it 3 days in b/c I couldn’t install the software. My friend had to help me install it. We are married now. Sometimes I make him install the updates on my Mac, for old times sake. True story.
17
24
631
38
52
447
@ClausWilke
Claus Wilke
2 years
Mood. #rstats
Tweet media one
9
45
440
@ClausWilke
Claus Wilke
3 years
Slides and worksheet on k-means clustering in R. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
83
440
@ClausWilke
Claus Wilke
2 years
Kids: "Mom, can we have an AI?" Mom: "We already have an AI at home." AI at home:
Tweet media one
5
44
430
@ClausWilke
Claus Wilke
7 years
Simulate color-vision deficiency in #rstats . Works on any R figure, on the fly. New work with @clairemcwhite . #ggplot @hadleywickham
Tweet media one
9
228
416
@ClausWilke
Claus Wilke
6 years
I have posted two new chapters for my #dataviz book and updated a few others. 1. (new) Visualizing nested proportions
Tweet media one
Tweet media two
Tweet media three
Tweet media four
6
116
405
@ClausWilke
Claus Wilke
4 years
I'm receiving feedback that (paraphrased) installing software "is easy" or "builds character" or "is a critical skill." My response: I doubt you've ever taught programming at scale to non-CS majors. Thread.
@ClausWilke
Claus Wilke
4 years
This is why we’ve gone away from installing software on student computers in my data science class. Installing python is literally harder than anything you could reasonably do with it (in an intro class) once it’s installed properly.
38
52
447
35
71
408
@ClausWilke
Claus Wilke
6 years
When you want to make sure nobody in the audience actually pays any attention to your data. @hadleywickham @kara_woo
14
69
394
@ClausWilke
Claus Wilke
6 years
This is a very nice intuitive explanation of how machine learning works.
3
126
392
@ClausWilke
Claus Wilke
5 years
I prepared step-by-step instructions for reproducing a few of the figures from my book. #rstats #ggplot2 #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
97
362
@ClausWilke
Claus Wilke
4 years
Happy to report that density contours will be much improved in the next ggplot2. #rstats #ggplot
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
42
367
@ClausWilke
Claus Wilke
7 years
100%. I like to say: The lab environment is more important than the specific science you'll be doing.
6
154
355
@ClausWilke
Claus Wilke
5 years
I just merged support for one of the most frequently requested features into the ggplot2 development branch: Plot titles that span the entire plot. #rstats #ggplot2
Tweet media one
11
41
353
@ClausWilke
Claus Wilke
6 years
It seems that the graduate student tax is in the senate tax bill as well. Whatever universities will do to address this issue, this will cause a resource drain from the US research enterprise. Bye-bye US leadership in STEM.
22
247
333
@ClausWilke
Claus Wilke
4 years
Me: Software installs are an unnecessary distraction in intro data science classes. Cloud computing is it. Women: Yup, agreed. Men:
Tweet media one
19
26
340
@ClausWilke
Claus Wilke
6 years
Before Slack, you would have to look in five different places. After Slack, it's six. 😜
@johnofa
John O'Farrell
6 years
Nice! “Before Slack, you would have to go look in five different places even to find a file,” says @SlackHQ 's @aunder . “Was it a PDF? Was it a file in the cloud? Slack serves as the common denominator that allows you to search across all of them.”"
1
5
41
2
34
343
@ClausWilke
Claus Wilke
3 years
Handling overlapping points #dataviz #rstats
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
45
317
@ClausWilke
Claus Wilke
7 years
I wrote a minimal R package to make joyplots with ggplot2. Pull requests welcome. #rstats
Tweet media one
Tweet media two
Tweet media three
8
98
313
@ClausWilke
Claus Wilke
6 years
I continue to be amazed that I can make a figure like this one in ~15 lines of #ggplot2 code. (And half of that is theme code to get the styling right. The logic is only 6 lines.) #rstats #dataviz
Tweet media one
11
51
298
@ClausWilke
Claus Wilke
4 years
When your tiny little R package becomes a ggplot2 dependency. #rstats
Tweet media one
3
12
299
@ClausWilke
Claus Wilke
3 years
I'm using xaringan for slides, learnr for interactive worksheets, and distill for the website. All sources are available on github.
4
22
299
@ClausWilke
Claus Wilke
6 months
What is the value of a PhD? The other day I commented on the financial value, but let's dive a little bit deeper into the value in terms of personal growth and development of career skills. (This is long. Click to expand and read.) Does it make sense to…
@ClausWilke
Claus Wilke
6 months
Our students get 6-figure offers straight out of grad school. Attended a defense last week (not my student, but I'm on the committee) where the student has a $210k/yr offer and was contemplating whether she should take it or not.
9
12
168
10
82
291
@ClausWilke
Claus Wilke
1 year
Some thoughts about "chain risk" and art NFTs on Ethereum and Tezos. 1/
36
105
274
@ClausWilke
Claus Wilke
4 years
Anthony Fauci.
Tweet media one
9
70
278
@ClausWilke
Claus Wilke
6 years
Just posted two new chapters for my book. #dataviz 1. Visualizing associations among quantitative variables
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
72
276
@ClausWilke
Claus Wilke
4 years
All my recent work was so I could make plots like this. @hadley didn't like it when I proposed this originally, but I think it's pretty nifty. #rstats
Tweet media one
Tweet media two
13
28
272
@ClausWilke
Claus Wilke
4 years
I think I'll start a collection. #dataviz
Tweet media one
7
18
272
@ClausWilke
Claus Wilke
4 years
The ggridges package now has a dedicated website. #rstats #ggplot
1
56
272
@ClausWilke
Claus Wilke
5 years
I recently added this error message to ggplot2. But now I have second thoughts. Maybe a more precise error message would have been "object of type 'closure' is not subsettable". #rstats
Tweet media one
10
14
270
@ClausWilke
Claus Wilke
7 years
9. Update: See here for the links to the relevant law.
@ClausWilke
Claus Wilke
7 years
1. Some tax-law updates on the earlier tweets. Tuition waivers are currently tax exempt due to §117(d), see here:
6
34
41
4
97
254
@ClausWilke
Claus Wilke
4 years
The log-scale plot of case fatality rates of COVID-19 vs age highlights an interesting finding: COVID-19 is consistently worse than the seasonal flu, at all ages. There is no age-dependent effect, just an overall shift in outcomes to the worse.
14
124
256
@ClausWilke
Claus Wilke
6 years
Visualizing the bootstrap. #rstats #AnimateAnyday
3
73
257
@ClausWilke
Claus Wilke
7 years
@RVAwonk It would be so helpful if Twitter just made it a policy to mark bots somehow, similar to the blue checkmark.
10
28
216
@ClausWilke
Claus Wilke
5 years
Automatic text reflow inside a box in ggplot2. #rstats #ggtext
7
45
239
@ClausWilke
Claus Wilke
5 years
ICYMI: This is an incredibly important release for anybody who needs to deal with colors. I couldn't have written my book without this package. In particular, lightening and darkening of colors on the fly is so useful.
@AchimZeileis
Achim Zeileis @[email protected]
5 years
New release of #rstats pkg #colorspace : refined and named palettes, #ggplot2 color scales, visualization and assessment, interactive color apps (shiny + Tcl/Tk), color vision deficiency emulation, and much more. #dataviz #endrainbow (1/12)
Tweet media one
Tweet media two
Tweet media three
2
236
718
0
49
239
@ClausWilke
Claus Wilke
6 years
While working on the next ggplot2 release, I've come to believe that for every bug in the ggplot2 code base there's at least one published R package that uses it as a feature.
3
22
236
@ClausWilke
Claus Wilke
6 years
Direct link to the book:
4
65
231
@ClausWilke
Claus Wilke
5 years
World maps, drawn in ggplot2. #rstats #ggplot
Tweet media one
Tweet media two
Tweet media three
Tweet media four
4
44
226
@ClausWilke
Claus Wilke
4 years
I don't have a SoundCloud, but I've written a book on data visualization you can read for free here. It's not about programming, and there's neither R nor python in the book.
3
53
223
@ClausWilke
Claus Wilke
5 years
My talk at #rstudioconf about visualizing uncertainty is now online.
1
58
224
@ClausWilke
Claus Wilke
4 years
The fact that the dplyr function `summarize()` can now generate multiple rows per group is incredibly useful. #rstats #tidyverse
Tweet media one
5
28
221
@ClausWilke
Claus Wilke
3 years
Compound figures. #dataviz #rstats
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
32
219
@ClausWilke
Claus Wilke
5 years
Scheduled release date is April 15. You can pre-order on Amazon now.
7
39
212
@ClausWilke
Claus Wilke
3 years
Dimension reduction 2: When regular PCA doesn't cut it. #rstats #dataviz
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
24
214
@ClausWilke
Claus Wilke
4 years
Says the person whose regular tag line is "humans don't understand exponential growth."
@elonmusk
Elon Musk
4 years
The coronavirus panic is dumb
38K
261K
1M
6
22
208
@ClausWilke
Claus Wilke
5 years
I've hit exactly 10,000 citations on Google Scholar today. 🍾🍾🍾
Tweet media one
8
1
205
@ClausWilke
Claus Wilke
6 years
Tweet media one
3
46
203
@ClausWilke
Claus Wilke
5 months
Why is it that most people in tech are so poorly informed about biology? No we couldn't make animals smarter. We literally don't have the technology. We don't even know the biological basis of "being smart." Maybe in 50 years. Not today. Not next year. Not this decade.
@rmcentush
Ryan McEntush
5 months
why aren’t we “uplifting” other species? this is a classic sci fi plot point that I feel like we just skipped over if we wanted to make animals much smarter, we could, no?
613
98
3K
28
29
199
@ClausWilke
Claus Wilke
2 years
@balajis That assumes it's easier to rebuild all physical infrastructure than to get self-driving to work. I wouldn't make that assumption. Self-driving will be solved soon (couple of years at most). 1/n
12
1
189
@ClausWilke
Claus Wilke
5 years
Create ROC and precision-recall curves #tidyverse style. New #rstats package under development by @dariyasydykova .
Tweet media one
4
52
193
@ClausWilke
Claus Wilke
6 years
ggplot2 3.1.0 was released yesterday and it seems nobody has noticed yet. I think that's a good sign. #rstats
2
30
190