Getting Started With an OPF Data Science Project

The outline below provides some helpful links to kick start the creation of a new data science project with the Operate First Data Science Community, outlining some of our suggested best practices and tool-kits.


Organize and Structure a Project

  • Review our project structure doc and get a sense for the phases of a data science project.

  • Create a public git repository using our project template. Feel free to use the template directly or initialize your project via the cookiecutter tool.

  • Start a github project board to track tasks and share ideas. Find more info and examples here

Set Up Bots and CI Tools

  • Set up the aicoe-ci tools for your repo. These tools can aid in various aspects of your development workflow by running pre-commit checks, build checks, triggering pipelines to build images and more automatically. Find more info on setting up these tools here.

Use Open Data Hub and JupyterHub instances

  • Sign in to JupyterHub here

  • Watch an intro tutorial on how to get started.

  • Learn how to monitor your resources here.

Join the Community!

  • Attend our meetups.

  • Say “Hi” on Slack in the #data-science channel.

  • Open an issue.