Published on Friday, November 19, 2021 and tagged with research and CS533.
One of my projects this year has been to refactor and upgrade our Book Data Tools to be easier to work with. Along the way I found some bugs in the original data integration, and the process of finding these bugs and assessing their impact forms a useful case study for some of the principles I teach in my data science class.
Published on Sunday, February 7, 2021 and tagged with productivity.
Ritual is a powerful tool for navigating life. David Allen, in Getting Things Done, promotes a weekly review to take stock of your current Next Action lists, identify projects that need to be moved up (or down) the stack, and plan what’s next.
Regular rituals are an important part of my work management, linking together my notebook, runway, and self-care. My particular practices are modeled most closely after the tactical practices of implementing Michael Linenberger’s One Minute To-Do List on paper.
Published on Thursday, February 4, 2021 and tagged with productivity.
In “Holes”, I noted that one of the holes in my work planning and management is a good tool for tracking and planning upcoming work over time. I have now filled that hole with my Runway Document, which also replaces “The Wall”.
I use Jupyter notebooks extensively for data analysis and exploration. It’s fantastic to be able to quickly see output, including plots, and have it all saved and persisted and viewable on GitHub.
However, when it comes time to prepare for publication, I need to save high-resolution and/or vector versions of the plots for use in LaTeX or Word. The display in Jupyter does not have nearly high enough resolution to copy and paste into a document and have it look acceptably good.
Most of my projects, therefore, have a convenience function for plots that are going into the paper. This function saves the plot to disk (in both PDF and 600dpi PNG formats) and returns it so it can also be displayed in Jupyter. That way I don’t have two copies of the plot code — one for saving and one for interactive exploration — that can get out of sync.
Published on Sunday, August 23, 2020 and tagged with teaching and CS533.
This fall is my third time teaching CS533 (Introduction to Data Science). I co-developed the class with Casey Kennington and taught the first offering in 2017.
This intro is a class I had long wanted to create — when I was on the job market the first time around, it was my answer to ‘what class would you like to create?’ — but I haven’t yet really been able to achieve what I wanted with it. This fall I am taking a step back and rebuilding it. I hope it’s successful.
This post discusses my design goals and parameters for the class, and its overall structure. I hope to expand on some specific aspects of it in future posts.