This repository contains Jupyter Notebooks for web scraping, transforming and loading flight data from 2 online travel companies.
-
Updated
Apr 28, 2023 - Jupyter Notebook
This repository contains Jupyter Notebooks for web scraping, transforming and loading flight data from 2 online travel companies.
This Repository consists of all the Jupyter Notebook (.ipynb) files, python files, excel sheets which are a part of the BCGX's Gen AI Virtual Job Simulation that is hosted on Forage.
Different notebooks which contain Web Data Extraction using python language.
This repository contains notebooks on various datasets as a practice on data analysis, all notebooks include: Data Cleaning. Data Visualization. Exploratory Data Analysis.
This Jupyter Notebook uses Pandas and data visualization libraries. We'll work with the famous Titanic dataset.
Extract text content from an HTML page, process it, and extract unique words from the processed text. This notebook utilizes various text processing techniques including cleaning, normalization, tokenization, lemmatization or stemming, and stop words removal.
Add a description, image, and links to the data-extraction topic page so that developers can more easily learn about it.
To associate your repository with the data-extraction topic, visit your repo's landing page and select "manage topics."