Google Colab is a free, cloud-hosted Jupyter Notebook environment where you can write and run Python code directly in your browser. It provides access free of charge to Google Cloud GPUs and TPUs, which is a game-changer for running AI models and simplifies project collaboration.
In December, we shared how the Data Science Agent in Colab creates notebooks for trusted testers using Gemini, removing tedious setup tasks like importing libraries, loading data, and writing boilerplate code. Trusted testers are enthusiastic about the Data Science Agent, reporting they are able to streamline workflows and uncover insights faster than ever before.
Today, we’re excited to bring Data Science Agent to Colab users age 18+ and in select countries and languages. This expands our university partnerships to help research labs save time on data processing and analysis by generating complete, working Colab notebooks from simple natural language descriptions.
2. Add your data: Upload your data file.
3. Describe your goals: Describe what kind of analysis or prototype you want to build in the Gemini side panel (e.g., "Visualize trends," "Build and optimize prediction model", “Fill-in missing values”, “Select the best statistical technique”).
4. Watch the Data Science Agent get to work: Sit back and watch as the necessary code, import libraries, and analysis is generated in a working Colab notebook.
Our Data Science Agent has also landed in 4th place on the DABStep: Data Agent Benchmark for Multi-step Reasoning on HuggingFace, ahead of ReAct agents based on GPT 4.0, Deepseek, Claude 3.5 Haiku, Llama 3.3 70B.
Give it a try by simply uploading some data and outlining your data analysis objectives from the Gemini side panel. You can explore datasets on Kaggle or Data Commons, but here are some sample data and prompts to try:
We hope this transforms your data analysis workflow. We can’t wait to hear what you think, please join our Google Labs Discord community and the #data-science-agent
channel to connect.