WORKSHOP - Web Scraping: Make the internet your playground

Schedule

Mon Aug 19 2024 at 04:00 pm to 06:00 pm

Location

3 Rue Rossini | Paris, IL

Advertisement
WORKSHOP - Web Scraping: Make the internet your playground
About this Event

Organizers

  • Raphael Vienne, Head of AI at datacraft
  • Rémy Gasmi, Data Scientist Intern at datacraft

Workshop introduction:

Scraping has been more and more recognized since LLMs became a thing, as these models rely on several petabytes of internet data for pre-training, that were extracted from web crawlers.

Every year, the internet produces tons of extremely valuable data. Some individuals might be interested in either collecting relevant data from the internet automatically, or even automate some actions online.

Both of these considerations can be done with scraping.

In this workshop, we will try to introduce participants to scraping, as well as discussing legal considerations regarding this practice.

Workshop summary:

In this workshop, we will:

  1. Introduce scraping libraries as well as legal considerations regarding scraping (when not to scrape).
  2. Start scraping on a simple example (extracting information from a wiki).
  3. Carry out a more complex scraping pipeline (scrape datacraft agenda and incoming events).
  4. Finally, let participants build their own scraping project (on the website of their choice).

If you thought of automating an online task once in a while, or if you’re just curious about scraping with python, this workshop is for you!

Come and benefit from the experience of our team on this domain.


2408-datacraft-Scraping

Advertisement

Where is it happening?

3 Rue Rossini, 3 Rue Rossini, Paris, France

Event Location & Nearby Stays:

Tickets

EUR 0.00

datacraft

Host or Publisher datacraft

It's more fun with friends. Share with friends