This project is built during my python stack internship @ Infosys Springboard.
- Python (version: 3.13.0 or above)
- Python IDE (Visual Studio Code / PyCharm / IDLE / Eclipse)
- Libraries : Streamlit, Beatiful Soup, Selenium
A website "DealsHunter" is built using streamlit in python, which scrapes the data from the website DealsHeaven[https://dealsheaven.in/] using beautifulsoup and requests libraries.
In the folder Milestone 1 -> Run app.py:
streamlit run app.py
A status bar is added and the UI is modified for visual appeal. Also, help section is provided.
In the folder Milestone 1 -> Run milestone_1.py:
streamlit run milestone_1.py
The DealsHunter website is enhanced further for better user experience and filtering by category is integrated. The products are displayed with their respective images and other details.
In the folder Milestone 2 -> Run milestone_2_t1.py:
streamlit run milestone_2_t1.py
Using Selenium, we scrape the states and their respective libraries information from the Public Libraries website. Using sqlite3, we store the scraped data in 2 tables, which are related to each other by having common state id. Using selenium, the scraped information of libraries for s specific chosen state is displayed.
In the folder Milestone 2 -> Run milestone_2_t2.py:
streamlit run milestone_2_t2.py
Using selenium, we scrape the job cards from the Behance Job Listings, up until the pages scrolled (here, default 10). Then, a gui is built using streamlit, where a dynamic search bar (which helps you search easier by providing pre-existing options in a drop down), is implemented and the corresponding job listings are displayed as cards. So, the scraper file must be executed before ui file, since it scrapes and stores the data.
In the folder milestone 3 -> Run the scraper file:
python scraper.py
Then run the ui file:
streamlit run ui.py