Skip to content

Tool to parse information in code source of a html page

License

Notifications You must be signed in to change notification settings

AnataarXVI/HTMLParser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HTMLParser

This Tool can parse information in code source of a html page.Its role is to collect information quickly.

Made by AnataarXVI

How it works ?

HTMLParser will send a request to the indicated website and retrieve information depending on the option chosen.

Installation

Download the project : https://github.com/AnataarXVI/htmlparser.git Install packages : pip install -r requirements.txt

Complete parsing

example_image1

Parsing script tag

2022-08-18_20-28

Make a tree of the website

2022-09-11_19-46

Usage

Usage: python htmlparser.py [options] <url>

Options:
  --version      show program's version number and exit
  -h, --help     show this help message and exit
  -a, --all      Parse with all options (-c, -s, -f, -l)
  -c, --comment  Find all comments
  -s, --script   Find all scripts
  -f, --form     Find all forms
  -l, --link     Find all links
  -t, --tree     Make a tree of the website directories

  Examples:
    python htmlparser.py -a <url>
    python htmlparser.py -c <url>
    python htmlparser.py -s <url>
    python htmlparser.py -f <url>
    python htmlparser.py -l <url>
    python htmlparser.py -t <url>

This tool is used to analyze the source code of an html page by recovering tags and comments

Contributing

If you liked the project do not hesitate to share it. I am free to any improvement proposal.

About

Tool to parse information in code source of a html page

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages