Last updated: April 26, 2026
OSINT (Open-Source Intelligence) is the reconnaissance layer of every pentest, red-team engagement, and threat-intel investigation. Done well, OSINT precedes any active scanning and produces 60-70% of the information an engagement needs. This article covers practitioner OSINT methodology in 2026 — the toolchain, the data sources, and the workflow that turns “we have a target name” into a comprehensive pre-engagement profile.
The mental model
OSINT for a pentest answers four questions:
- What does the target’s external attack surface look like? (subdomains, IPs, services, exposed admin)
- What technology stack? (CMS, frameworks, cloud provider, vendors)
- What people are publicly identifiable? (employees on LinkedIn, technical leads on GitHub)
- What recent events or context? (acquisitions, security incidents, leadership changes, breach exposure)
The practitioner toolchain
Subdomain enumeration
- Subfinder — fast passive enumeration via 30+ APIs
- amass — comprehensive (passive + active + brute-force)
- assetfinder — quick listings
- chaos.projectdiscovery.io — public dataset of subdomains
- crt.sh — certificate transparency logs (free web)
subfinder -d target.com -all -recursive -o subdomains.txt
amass enum -d target.com -active -src
echo target.com | chaos -silent
Live host discovery
- httpx — fast HTTP probe + tech detection
- naabu — fast port scanner from ProjectDiscovery
- shuffledns — DNS resolver at scale
cat subdomains.txt | httpx -title -tech-detect -status-code -o live.txt
Vulnerability surface
- nuclei — template-based vulnerability scanning, >5000 community templates
- nuclei-templates repo updated continuously
cat live.txt | nuclei -t cves/ -t exposures/ -t misconfiguration/ -severity high,critical
People & organisation
- theHarvester — emails, names, hosts, employees from public sources
- LinkedIn + Hunter.io for email patterns
- haveibeenpwned API — breach exposure for collected emails
- Maltego — graph relationships across data points
GitHub / code-search
- truffleHog / gitleaks — secrets scanning across target’s repos
- GitHub search syntax —
"target.com" filename:.env,"target.com" "AWS_SECRET" - gauplus / waybackurls — URL history from web archives
Cloud-specific
- cloudenum — enumerate cloud assets (S3 buckets, Azure blobs, GCP storage) by name patterns
- aws-cred-scanner — find leaked AWS credentials
- shodan / censys / fofa — internet-wide scan results, search for target’s exposed services
The workflow
- Domain enumeration. Subfinder + amass, dedupe.
- Live discovery. httpx + naabu against the deduplicated list.
- Tech fingerprint. httpx tech-detect, supplemented by Wappalyzer.
- Vulnerability scan. nuclei with high/critical severity templates.
- People & emails. theHarvester + LinkedIn for org chart.
- Code-side. GitHub search for target-domain references; truffleHog on any public repos.
- Cloud assets. cloudenum with target-specific naming patterns.
- Threat-intel context. haveibeenpwned for collected emails; Shodan for exposed services; news search for recent events.
- Synthesise. Build a target profile document with attack surface, tech stack, people of interest, exposure highlights.
Tooling automation
Modern tools chain these into pipelines:
- reconftw — full-stack OSINT and vuln scanning automation
- autorecon — comprehensive enumeration with reporting
- osmedeus — modular reconnaissance framework
Run these for breadth; supplement with manual investigation for depth.
Defender perspective
- Run OSINT against your own organisation. Findings are your external attack surface.
- Subscribe to attack-surface management products (Censys ASM, Microsoft Defender External ASM, BitSight)
- Hunt for leaked credentials in code repositories continuously
- Train developers on secret-management patterns to reduce GitHub leakage
Compliance angle
- SEBI CSCRF — external attack surface awareness expected
- RBI — third-party risk visibility includes vendor OSINT exposure
- DPDP §8(5) — leaked credentials creating personal-data exposure are reasonable-security failures
The takeaway
OSINT before active testing produces 60-70% of an engagement’s intelligence at zero detection cost. The toolchain has matured — ProjectDiscovery suite, modern automation frameworks, cloud-specific enumeration. Every red-team engagement and self-audit should start with a comprehensive OSINT phase. The findings often surprise the target organisation more than the active scanning does.
Get a VAPT scoping call
Senior practitioner-led VAPT — not a checklist run by juniors. CVSS-scored findings, free retest, attestation letter. India's SMBs and SaaS teams.