From Raw Data to ML-Ready: Dataset Curation with Pandas

Schedule

Tue Feb 24 2026 at 02:00 pm to 04:00 pm

UTC-06:00

Location

John Crerar Library - Kathleen A. Zar Room | Chicago, IL

Advertisement
Presenter: Debasmita Samaddar
About this Event

Data Science and Machine Learning(ML) have come to play a crucial role in a wide range of domains, from biological and physical sciences and  engineering to finance and social science. In practice, however, most ML pipelines begin with datasets that need substantial curation before being used for any meaningful analytic purposes.

This workshop will focus on practical techniques for curating and analyzing datasets. Topics include handling missing values, working with mixed numerical and non-numerical data, and preparing data for downstream Machine Learning tasks. Pandas will be used for exploratory data analysis and data preparation, with scikit-learn introduced to demonstrate how curated data feeds into ML models.

Participants will work through hands-on exercises to explore dataset properties, identify common data quality issues, and develop strategies for transforming raw data into ML-ready inputs. The workshop will be conducted on the Midway HPC system, demonstrating workflows suitable for both local and high-performance computing environments.

  • Do you have a large dataset but aren’t sure how to prepare it for use in a Machine Learning tool?
  • Want to understand your data’s structure and properties before feeding into a Machine Learning pipeline?
  • Are missing, inconsistent, or messy values  breaking your ML pipeline?
  •  Have you heard of or used tools like “Pandas” and “Scikit-learn” but want a clearer, hands-on  understanding of how they fit into data preparation?

If the answer to any of these questions is “yes” – this workshop is for you.

Objectives: 


By the end of this workshop, participants will be able to:

  • understand the core functionalities of Pandas tools and the basic workflow of Scikit-learn.
  • Build an end-to-end  pipeline that transforms raw data into a trained ML model.
  • Apply demonstrated techniques to curate datasets  and train r ML models on their own data 

Level: Intermediate


Duration: 2 hours

Prerequisites: Working knowledge of Python. All participants are encouraged to bring a laptop with a Mac, Linux, or Windows operating system. Having an RCC account will be helpful to perform the exercises on Midway3.


Advertisement

Where is it happening?

John Crerar Library - Kathleen A. Zar Room, 5730 South Ellis Avenue, Chicago, United States

Event Location & Nearby Stays:

Tickets

USD 0.00

Icon
Know what’s Happening Next — before everyone else does.
Research Computing Center

Host or Publisher Research Computing Center

Ask AI if this event suits you:

Discover More Events in Chicago

NIFB Volunteer meal packing
Tue, 24 Feb at 09:00 am NIFB Volunteer meal packing

Northern Illinois Food Bank

VOLUNTEERING
Leadership Mastery:Inspire, Motivate & Lead Like a Pro! in Chicago,  IL
Tue, 24 Feb at 09:00 am Leadership Mastery:Inspire, Motivate & Lead Like a Pro! in Chicago, IL

For venue details reach us at: [email protected]

WORKSHOPS BUSINESS
Sewing Basics Workshop @ IRL1
Tue, 24 Feb at 11:00 am Sewing Basics Workshop @ IRL1

Idea Realization Lab at DePaul University

WORKSHOPS
Galentine\u2019s Day Mini Art Workshops! - Foam Flower Mini Workshop
Tue, 24 Feb at 01:30 pm Galentine’s Day Mini Art Workshops! - Foam Flower Mini Workshop

Hey, I Thought Of You

WORKSHOPS ART
Material Witness
Tue, 24 Feb at 05:00 pm Material Witness

1545 N Western Ave

ART EXHIBITIONS
Sip & Paint: Cheeky Stillife
Tue, 24 Feb at 05:30 pm Sip & Paint: Cheeky Stillife

LMN Wedge Studio & Gallery

ART FINE-ARTS
New Renovations of Adler & Sullivan's 1889 Auditorium Theatre
Tue, 24 Feb at 05:30 pm New Renovations of Adler & Sullivan's 1889 Auditorium Theatre

Charnley-Persky House Museum

ART THEATRE
Andrea Bocelli
Tue, 24 Feb Andrea Bocelli

United Center

CONTESTS TRIPS-ADVENTURES
The Dinner Detective True Crime M**der Mystery Dinner Show - Chicago, IL
Sat, 22 Jun at 06:00 pm The Dinner Detective True Crime M**der Mystery Dinner Show - Chicago, IL

Courtyard by Marriott Chicago Downtown/Magnificent Mile

PERFORMANCES ENTERTAINMENT
Learn Akashic Records Reading
Sat, 28 Sep at 01:00 pm Learn Akashic Records Reading

The Chakra Shoppe

WORKSHOPS HEALTH-WELLNESS
Learn How to Use a Pendulum
Wed, 02 Jun at 07:00 pm Learn How to Use a Pendulum

The Chakra Shoppe

WORKSHOPS ART
Meet and Create! - The Black Light Experience
Fri, 16 Jul at 06:30 pm Meet and Create! - The Black Light Experience

1438 E 52nd St

PARTIES WORKSHOPS
Group Past LIfe Regression
Tue, 05 Oct at 07:00 pm Group Past LIfe Regression

The Chakra Shoppe

WORKSHOPS
Learn to Use Crystals
Wed, 20 Oct at 07:00 pm Learn to Use Crystals

The Chakra Shoppe

BECOME A HOME-BASED TRAVEL AGENT | Burbank, IL
Thu, 24 Mar at 07:00 pm BECOME A HOME-BASED TRAVEL AGENT | Burbank, IL

6520 S Cicero Ave

VIRTUAL
MINDSHOP \u2122| Data Analysis for Management
Wed, 18 May at 07:00 pm MINDSHOP ™| Data Analysis for Management

Your Laptop

WORKSHOPS STORYTELLING
Ayodele Youth Program presents......Youth Dance Classes and Arts and Crafts
Sat, 04 Jun at 11:30 am Ayodele Youth Program presents......Youth Dance Classes and Arts and Crafts

Sherman Park

WORKSHOPS ART
Let\u2019s Read A Play: Come read aloud or listen
Fri, 13 Jan at 07:00 pm Let’s Read A Play: Come read aloud or listen

Green Shirt Studio

ART THEATRE
Espresso  at Home- Chicago
Fri, 03 Feb at 02:00 pm Espresso at Home- Chicago

Counter Culture Coffee Chicago

WORKSHOPS ART
Spinning Babies Birth Preparation Class - Chicago
Sun, 26 Feb at 09:00 am Spinning Babies Birth Preparation Class - Chicago

Birth Center of Chicago

WORKSHOPS

What's Happening Next in Chicago?

Discover Chicago Events