Clear Web OSINT Toolkit - Quick start guide

Disclaimer

These tools are intended for authorized (where applicable) and ethical (always) use only.

You are responsible for ensuring your use complies with all applicable laws, regulations, and terms of services.

Somewhat unsurprisingly, the author (I) does (do) not assume liability for any misuse.

Preface:

Scripts are named fairly self-descriptively, they can all be downloaded or one can pick and choose. If picking and choosing, make sure to also find and install the required dependencies from `requirements.txt`, they are labeled per tool.

`c_phone_osint.py` requires two free API keys to generate a full report, but it will work without them (just returns less data). More detailed instructions below and commented into `c_phone_osint.py`.

`c_ip_osint.py` can be supplemented with one free API key, again it will work without (just returns slightly less data). Optional paid API key integration for even more data, but this is again not needed. More detailed instructions below and commented into `c_ip_osint.py`.

You can run all of these without creating any API keys at all, just some of them work a little better with.

Setup

Assuming you have a recent version of python installed on your system:

Create a new folder in your desired location
Clone this repository or copy the desired .py files into that folder
Create a Python virtual environment inside that folder
Activate the virtual environment
Install dependencies:

   pip install -r requirements.txt

Run the tool:

   python <filename>.py

Note: If only using one tool, install only what you need by selecting specific packages from requirements.txt

Tools

Tool	File	Description
Phone OSINT	`c_phone_osint.py`	Phone number intelligence, spam detection, geocoding, clearnet mention scan
Domain OSINT	`c_domain_osint.py`	DNS, WHOIS, SSL, HTTP headers, tech fingerprint, risk score, page crawl, WP scan
IP OSINT	`c_ip_osint.py`	IP geolocation, ASN & routing, passive DNS, reverse IP, port probe, abuse reputation

███████████████████████████████████████████████ ███████████████████████████████████████████████

Phone OSINT Tool

Usage

python c_phone_osint.py

Optional flags

python c_phone_osint.py --save # Saves full report to JSON after scan completes

(sort of) Necessary setup for full functionality!!

This tool supports two optional API keys for additional output

You can set these in `c_phone_osint.py`

Generate an IPQS API key and set IPQS_API_KEY: str = '' # Put it in here
Generate a SerpAPI API key and set SERPAPI_KEY: str = '' # Put it in here

More detailed instructions can be found in the code. Yes it will work without.

Usage flow

You will be prompted for country code, area code, and the rest of the number separately
The main report is generated immediately after
You will then be asked whether to run a clearnet mention scan — default is no
If the scan finds more than 10 results, you will be prompted to save the full results as a .json file

How it works

Uses phonenumbers library to assess number validity, country, region, line type, carrier info, and timezone
Determines spam likelihood with external query to SkipCalls service
Determines rough geocoordinates with external query to Nominatim (openstreetmap)
If valid IPQS API key was provided, additional data will be added to the report like Do Not Call list status
If valid SerpApi API key was provided and clearnet mention scan ran, the tool will query SerpAPIs Google and DuckDuckGo search APIs for mentions of the number in different formats across the clear web

Screenshots

Report if no valid IPQS API key:

What IPQS Key adds to report:

Clearnet mention scan (only works with valid SerpAPI Key):

If more than ten results:

Notes:

Spam likelihood check with SkipCalls is far from perfect and will produce a lot of false negatives, but there are not many truly free services offering programmatic access to large datasets of known spam numbers so it will have to do until I find a better way

phonenumbers library carrier detection iffy particularly for U.S. numbers, paid services would do a better job. But would require payment.

Geocoding is Region/City level not very exact (This is expected)

IPQS can prove to be a very difficult service to deal with. They use aggressive duplicate free account prevention software, potentially making access to the API very difficult without paying them money. When creating an account for the first time, my advice is to do so while not connected to a VPN or public Wifi network, and from a browser like Google or Safari.

███████████████████████████████████████████████

Domain (URL) OSINT Tool

Usage

python c_domain_osint.py

Optional flags

python c_domain_osint.py --save # Saves full report to JSON after scan completes

Usage flow

You will be prompted for the target domain and a max page crawl limit (1-50, default 20)
DNS, WHOIS, SSL, HTTP headers, technology fingerprint, and calculated risk score run automatically
You will be prompted whether to run the URL threat scan
You will be prompted whether to run the port scan, which checks 14 common ports via raw socket against the resolved IP
You will be prompted whether to run the page crawl, which visits up to your specified limit of pages on the target domain, collecting emails, internal links, and external domain references
You will be prompted whether to save the report at the end, or pass --save to skip the prompt

How it works

Resolves A, PTR (reverse DNS), MX, NS, and TXT records via DNS
Fetches WHOIS registration data including registrar, creation date, expiry, and nameservers
Retrieves and parses the SSL/TLS certificate including issuer, expiry, days remaining, and Subject Alternative Names
Fetches HTTP response headers and checks for the presence or absence of key security headers with vulnerability mapping
Probes sensitive paths and reports HTTP status, exposure level, and contextual notes
Evaluates content security policy and
Fingerprints web technologies from headers and HTML including web server, CMS, JS framework, CDN, and hosting provider
If WordPress is detected, extracts the version from up to 4 sources, maps it against known CVEs, and checks for exposed endpoints
Calculates a passive risk score (0-100) derived from already collected data
Optionally runs a port scan against the resolved IP across 14 common ports
Optionally scrapes public data sources for malware, phishing, spam, and risk score data
Optionally crawls the target domain sideways up to the page limit, extracting emails, mapping internal page links, and cataloguing all referenced external domains

Screenshots

Initial prompts:

Main scan:

Sensitive path probe

Common port scan and status

Crawl results (emails, external domains, internal links):

Notes

The crawl uses a 1 second delay between requests out of politeness to the target server. On sites with many pages, hitting the 50 page cap will happen. The tool will warn you if links were detected but the crawl was stopped early due to the page limit, and you can always raise the limit (carefully).

Crawling/scraping the internet/specific websites on the internet may or may not be bad and/or wrong and/or illegal and/or against TOSs it's a subjective matter

As of Apr. 2026, this tool can scrape websites protected by Cloudflare and other CDNs, extracting emails and links.

Technology fingerprinting is based on headers and static HTML. Client-side rendered sites may not reveal their full stack this way, and some detections may be absent or inaccurate if the site actively suppresses identifying headers.

The URL threat scan scrapes public web pages rather than using an API. It may break if page structures change, in which case the tool should report the section as unavailable without affecting the rest of the scan.

Whois data availability varies. Privacy-protected domains will return minimal registrant information, and some TLDs rate-limit or straight up block whois queries entirely.

The calculated risk score is a passive system derived from your own scan data. It is not a substitute for a dedicated threat intelligence feed and should be treated as an indicator rather than a definitive assessment.

███████████████████████████████████████████████

IP OSINT Tool

Usage

python c_ip_osint.py

Optional flags

python c_ip_osint.py --save # Saves full report to JSON after scan completes

This tool supports two optional API keys for additional output

You can set these in `c_ip_osint.py`

Generate an AbuseIPDB API key and set ABUSEIPDB_API_KEY: str = ''
Generate a Shodan API key and set SHODAN_API_KEY: str = ''

Note: Shodan host lookup requires a paid Shodan membership. The free API key tier does not grant access to the /shodan/host/{ip} endpoint. The tool will still function without this key so don't worry. More detailed instructions can be found in the code. Yes it will work without this key or without any keys just less data output

Usage flow

You will be prompted for a target IP address (IPv4 or IPv6)
If the IP is private or loopback, a local interface scan runs and the tool exits
For public IPs, geolocation, ASN, routing, passive DNS, and reverse IP lookup run automatically
AbuseIPDB and Shodan enrichment run automatically if API keys are set (it's okay if they aren't)
You will be prompted whether to run a ICMP ping of 10 packets
You will be prompted whether to run a common port probe
You will be prompted whether to save the report, or pass --save to skip the prompt

Unless the terms of the website change, you should be able to test this tool with 45.33.32.156 or 2600:3c01::f03c:91ff:fe18:bb2f (same thing) which is for http://scanme.nmap.org

Important: this script can be ran 100 times per day per IP before the reverse IP section of the report may stop working, and 1,000 times per day per IP before the geocoding/ASN resolution part will stop working. This is due to Hackertarget and Ipapi free tier limits respectively.

TLDR: just don't run `c_ip_osint.py` more than 100 times in one day.

How it works

Classifies the IP (public/private/loopback, IPv4/IPv6) and performs a reverse DNS lookup
For private or loopback addresses, collects and displays local network interface info (IPs, MACs, subnet masks) and exits
Fetches geolocation, timezone, and ASN data from ipapi.co
Queries RIPEstat for BGP prefix, announcement status, RIR allocation, ASN holder, abuse contacts, and routing peers
Fetches passive DNS records from RIPEstat showing what domain names have historically resolved to the target IP, creates recursive DNS chain
If valid AbuseIPDB key is set, retrieves the abuse confidence score, total reports, distinct reporters, and usage type for the IP
Checks the target IP against the Tor Project's published bulk exit node list
Classifies the host as datacenter/cloud, residential ISP, or mobile carrier based on ASN name, org string, and BGP prefix size, with confidence level and supporting signals
Performs reverse IP lookup by querying HackerTarget, extracting TLS certificate SANs, and cross-referencing Shodan and passive DNS records to enumerate domains hosted on the target. Each discovered domain is subsequently HTTP-probed with a spoofed Host header to confirm live services, surfacing page titles and status codes.
If valid Shodan key is set, retrieves open ports, service banners, hostnames, OS, and tags from Shodan's crawl data
Optionally sends ICMP requests to measure round-trip time and jitter, will display latency and packet loss, with a note if the host appears to be filtering ICMP
Optionally probes ~24 common ports via direct TCP connect with concurrent threads, with banner grabbing on text-protocol ports Performs a multi-source reverse IP lookup by .

Screenshots

Initial scan:

ASN and DNS part of the scan:

Turns out I no longer have access to a Shodan API key, it really doesn't matter all that much not to have one though.

Reverse IP and domain mapping

ICMP pings

Port scan (and save):

Don't do this if you do not own the IP or have been explicitly authorized to scan it!

Notes

Private and loopback IPs (e.g. 192.168.x.x, 10.x.x.x, 127.0.0.1) trigger local mode only.

Geolocation precision is imperfect guesswork and varies by IP and provider. ISP-owned address blocks may only resolve to a country or region level.

Passive DNS coverage depends on what RIPEstat has observed. Many IPs, particularly those not associated with hosted domains, will return no records.

DNS chain resolution depends on a usable reverse DNS hostname. If reverse DNS returns nothing and passive DNS has no records, the chain will not be fetched.

The port probe uses short TCP connect timeouts and concurrent threads to stay fast, and results are indicative not exhaustive. Firewalls that drop rather than reject packets will cause ports to time out silently, possibly causing false negatives.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
c_ip_osint.py		c_ip_osint.py
c_phone_osint.py		c_phone_osint.py
c_url_osint.py		c_url_osint.py
exif_tool.py		exif_tool.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Clear Web OSINT Toolkit - Quick start guide

Table of Contents

Disclaimer

These tools are intended for authorized (where applicable) and ethical (always) use only.

You are responsible for ensuring your use complies with all applicable laws, regulations, and terms of services.

Somewhat unsurprisingly, the author (I) does (do) not assume liability for any misuse.

Preface:

Scripts are named fairly self-descriptively, they can all be downloaded or one can pick and choose. If picking and choosing, make sure to also find and install the required dependencies from requirements.txt, they are labeled per tool.

c_phone_osint.py requires two free API keys to generate a full report, but it will work without them (just returns less data). More detailed instructions below and commented into c_phone_osint.py.

c_ip_osint.py can be supplemented with one free API key, again it will work without (just returns slightly less data). Optional paid API key integration for even more data, but this is again not needed. More detailed instructions below and commented into c_ip_osint.py.

You can run all of these without creating any API keys at all, just some of them work a little better with.

Setup

Assuming you have a recent version of python installed on your system:

Tools

Phone OSINT Tool

Usage

Optional flags

(sort of) Necessary setup for full functionality!!

This tool supports two optional API keys for additional output

You can set these in c_phone_osint.py

Usage flow

How it works

Screenshots

Report if no valid IPQS API key:

What IPQS Key adds to report:

Clearnet mention scan (only works with valid SerpAPI Key):

If more than ten results:

Notes:

Spam likelihood check with SkipCalls is far from perfect and will produce a lot of false negatives, but there are not many truly free services offering programmatic access to large datasets of known spam numbers so it will have to do until I find a better way

phonenumbers library carrier detection iffy particularly for U.S. numbers, paid services would do a better job. But would require payment.

Geocoding is Region/City level not very exact (This is expected)

███████████████████████████████████████████████

Domain (URL) OSINT Tool

Usage

Optional flags

Usage flow

How it works

Screenshots

Initial prompts:

Main scan:

Sensitive path probe

Common port scan and status

Crawl results (emails, external domains, internal links):

Notes

Crawling/scraping the internet/specific websites on the internet may or may not be bad and/or wrong and/or illegal and/or against TOSs it's a subjective matter

As of Apr. 2026, this tool can scrape websites protected by Cloudflare and other CDNs, extracting emails and links.

Technology fingerprinting is based on headers and static HTML. Client-side rendered sites may not reveal their full stack this way, and some detections may be absent or inaccurate if the site actively suppresses identifying headers.

The URL threat scan scrapes public web pages rather than using an API. It may break if page structures change, in which case the tool should report the section as unavailable without affecting the rest of the scan.

Whois data availability varies. Privacy-protected domains will return minimal registrant information, and some TLDs rate-limit or straight up block whois queries entirely.

The calculated risk score is a passive system derived from your own scan data. It is not a substitute for a dedicated threat intelligence feed and should be treated as an indicator rather than a definitive assessment.

███████████████████████████████████████████████

IP OSINT Tool

Usage

Optional flags

This tool supports two optional API keys for additional output

You can set these in c_ip_osint.py

Usage flow

Important: this script can be ran 100 times per day per IP before the reverse IP section of the report may stop working, and 1,000 times per day per IP before the geocoding/ASN resolution part will stop working. This is due to Hackertarget and Ipapi free tier limits respectively.

TLDR: just don't run c_ip_osint.py more than 100 times in one day.

How it works

Screenshots

Initial scan:

ASN and DNS part of the scan:

Reverse IP and domain mapping

ICMP pings

Port scan (and save):

Notes

Private and loopback IPs (e.g. 192.168.x.x, 10.x.x.x, 127.0.0.1) trigger local mode only.

Geolocation precision is imperfect guesswork and varies by IP and provider. ISP-owned address blocks may only resolve to a country or region level.

Passive DNS coverage depends on what RIPEstat has observed. Many IPs, particularly those not associated with hosted domains, will return no records.

DNS chain resolution depends on a usable reverse DNS hostname. If reverse DNS returns nothing and passive DNS has no records, the chain will not be fetched.

The port probe uses short TCP connect timeouts and concurrent threads to stay fast, and results are indicative not exhaustive. Firewalls that drop rather than reject packets will cause ports to time out silently, possibly causing false negatives.

A score of 0 does not guarantee an IP is clean, 100 does not mean it is necessarily dirty

About

Topics

Resources

Scripts are named fairly self-descriptively, they can all be downloaded or one can pick and choose. If picking and choosing, make sure to also find and install the required dependencies from `requirements.txt`, they are labeled per tool.

`c_phone_osint.py` requires two free API keys to generate a full report, but it will work without them (just returns less data). More detailed instructions below and commented into `c_phone_osint.py`.

`c_ip_osint.py` can be supplemented with one free API key, again it will work without (just returns slightly less data). Optional paid API key integration for even more data, but this is again not needed. More detailed instructions below and commented into `c_ip_osint.py`.

You can set these in `c_phone_osint.py`

You can set these in `c_ip_osint.py`

TLDR: just don't run `c_ip_osint.py` more than 100 times in one day.

Packages