Skip to content

Latest commit

 

History

History
71 lines (52 loc) · 2.26 KB

README.md

File metadata and controls

71 lines (52 loc) · 2.26 KB

HTMLParser

This Tool can parse information in code source of a html page.Its role is to collect information quickly.

Made by AnataarXVI

How it works ?

HTMLParser will send a request to the indicated website and retrieve information depending on the option chosen.

Installation

Download the project : https://github.com/AnataarXVI/htmlparser.git Install packages : pip install -r requirements.txt

Complete parsing

example_image1

Parsing script tag

2022-08-18_20-28

Make a tree of the website

2022-09-11_19-46

Usage

Usage: python htmlparser.py [options] <url>

Options:
  --version      show program's version number and exit
  -h, --help     show this help message and exit
  -a, --all      Parse with all options (-c, -s, -f, -l)
  -c, --comment  Find all comments
  -s, --script   Find all scripts
  -f, --form     Find all forms
  -l, --link     Find all links
  -t, --tree     Make a tree of the website directories

  Examples:
    python htmlparser.py -a <url>
    python htmlparser.py -c <url>
    python htmlparser.py -s <url>
    python htmlparser.py -f <url>
    python htmlparser.py -l <url>
    python htmlparser.py -t <url>

This tool is used to analyze the source code of an html page by recovering tags and comments

Contributing

If you liked the project do not hesitate to share it. I am free to any improvement proposal.