Project Structure#

I would like to defer this discussion to the discussion presented by Cookie Cutter Data Science. The template that I use for my blank_project repository borrows from their template.

The key take-aways from the Cookie Cutter Data Science project structure discussion are these:

  • Other people will thank you

  • You will thank you

  • Data is immutable (pull fresh when you can)

  • Analysis is a DAG (build system)

  • “Build from the environment up” (Use a virtual environment and start sparse)

  • Keep secrets out of version control. Use .env files.