Vitamin D and Covid: We don't need to wait for more data. May 15, 2020 I recently posted this graph on Twitter which suggests that Covid-19 mortality is related to latitude: There’s a particular type of person on the internet who sees a graph like this and reaches deep into their data science boot-camp memories to exclaim “Correlation doesn’t imply causation!” or “There’s no randomized clinical trial!” or even “There are differences in testing strategy!” These responses are stupid not because they’re wrong but because they ignore how decision-making works. ...
Covid Tax May 14, 2020 Let’s say you are a market-oriented dictator who is worried about Covid-19 and wants to reduce the size and number of social gatherings. You understand that gatherings generate a lot of value for your population and so you want a mechanism to ban the least valuable gatherings while allowing the high value ones to continue. For example, you would probably want to reduce the number of work karaoke parties while allowing people to go to their best friend’s wedding. ...
Vitamin D and Covid-19 May 3, 2020 In a recent piece about the puzzling ways that Covid-19 has spread across the world the New York Times explores a number of possible theories about why Covid-19 has affected some countries more grievously than others, including “demographics, culture, environment, and the speed of government responses.” I think Vitamin D status should probably be included in this conversation. A tale of two countries Canada and Australia have had pretty similar Covid-19 timelines. ...
Why I use R Dec 30, 2019 They said the war was over… Over the last couple of years prominent members of both the R and Python communities have tried to move past the language wars and support both R and Python workflows. This makes sense intellectually; after all, R and Python are not all that different in the scheme of things, and so we should let people use whichever language they find more productive. This conversation manifests very differently in the workplace, however. ...
Technical debt for data scientists Apr 19, 2019 Technical debt is the process of avoiding work today by promising to do work tomorrow. A team might identify that there’s a small time window for a particular change to be implemented and the only way they can hit that window is to take shortcuts in the development process. They might soberly calculate that the benefits of getting something done now are worth the costs of fixing it later. This kind of technical debt is similar to taking out a mortgage or small business loan. ...
Testing machine learning models with testthat May 1, 2018 Automated testing is a huge part of software development. Once a project reaches a certain level of complexity, the only way that it can be maintained is if it has a set of tests that identify the main functionality and allow you to verify that functionality is intact. Without tests, it’s difficult or impossible to identify where errors are occurring, and to fix those errors without causing further problems. ...
Advice for non-traditional data scientists Aug 29, 2017 I have a pretty strange background for a data scientist. In my career I’ve sold electric razors, worked on credit derivatives during the 2008 financial crash, written market reports on orthopaedic biomaterials, and practiced law. I started programming in R during law school, partly as a way to learn more about data visualization and partly to help analyze youth criminal justice data. Over time I came to enjoy programming more than law and decided to make the switch to data work about three years ago. ...
Why you should work remotely, even if you're not remote May 3, 2017 My last job was as a data scientist at Upworthy, which is a 100% remote company. Prior to starting the position I was worried about whether I could be happy and productive on a remote team. I wondered how project planning would work, whether it would be terribly lonely, and how communication would function when things got hectic. What I discovered is that the company was one of the more efficient and friendly places that I’ve worked, and I think the changes that they have made to accommodate remote work deserve much of the credit. ...
Data Visualization and UI design Apr 13, 2017 Over the past couple of months, I’ve been rebuilding the Shambhala Meditation Timer using React Native and Redux. The idea behind the Shambhala app was to create a kind of modular framework for building meditation timers in order to allow people to create complex timers out of simple components. The three build blocks for a timer are time intervals, gong sounds, and recorded audio contemplations, and the user can stack these building blocks to create whatever kind of meditation session they want. ...
R for Excel Users Feb 2, 2017 Like most people, I first learned to work with numbers through an Excel spreadsheet. After graduating with an undergraduate philosophy degree, I somehow convinced a medical device marketing firm to give me a job writing Excel reports on the orthopedic biomaterials market. When I first started, I remember not knowing how to anything, but after a few months I became fairly proficient with the tool, and was able to build all sorts of useful models. ...