DNSweeper – Asynchronous Public DNS Auditing Tool

Last Release: 01/10/2019    

DNSweeper – Asynchronous Public DNS Auditing Tool

Introduction

DNSweeper is DNS auditing tool which uses asynchronous libraries for querying and extracting DNS data from public resolvers around the globe. Written in Python, it’s designed to extend information about domains and subdomains (works best if you feed it with already scraped subdomains).

DNSweeper Logo

DNSweeper: Asynchronous Public DNS Auditing Tool

As Petr Javorik mentioned, the idea for this tool came from MassDNS (another DNS bruteforce tool). Initially he wanted just to select reliable public resolvers and feed them to MassDNS:

“You have to feed it with reliable public resolvers otherwise you will spend up significant time by cleaning up results. As time went by I got an idea that I could integrate results from vertical DNS enumeration into other scripting tools without using awk, sed, grep etc. So I made all results from DNSweeper available as json.”

As we’re all aware, some open resolvers (frequently related to Level3 DNS) provide results even when the domain is non-existent (a marketing trick to utilize your traffic). So, the main advantage of DNSweeper is that it doesn’t provide “false” positives:

“No subdomains pointing to empty A records are returned. Results from censoring resolvers and other “bad” resolvers are filtered out.

It resolves enumerated domain (only ONE) by TRUSTED resolvers. We get set of trusted A records. Now it queries all public resolvers for enumerated domain and extracts A records from each of them. Then it compares each public resolver’s result if returned A record(s) returned from particular public resolver belong to results from trusted resolvers. If so then public resolver is moved for next filtering phase. “

Filtering

“Next filtering phase consists of querying random subdomain for enumerated domain (njvwlsjdngbitkep.test-domain.com). If resolver returns valid A record then it’s discarded.”

With this approach Petr was able to extract ~10000 public resolvers for subsequent bruteforcing and other commands. In later testing phases he noticed that it was still not enough, bruteforce still gave few false positives. He solved it by final fine filtering:

“All extracted subdomains are finally queried by trusted resolvers only, if no A records returned then subdomain is discarded”

Filtering resolvers is ongoing research, and he’s very much interested in hearing other ideas on how to improve it, so feel free to contribute. Tool was tested on OS X and Linux, and as Javorik mentioned, Windows testers are more then welcome.

Two stage Brute-force command

Another neat feature of this tool is two stage bruteforce command, where DNSweeper first bruteforces given domain (with huge wordlist) and then it recursively bruteforces previously discovered subdomains (with smaller wordlist, defined with --bruteforce-recursive option). In case “--bruteforce-recursive FILE” flag is omitted, no recursive bruteforce is performed.

Petr also mentioned that it’s always better to feed DNSweeper with file that includes already scraped subdomains. Those subdomains can be hosted on IPs and Subnets that don’t have to be discovered via bruteforcing. By not including them we can miss some ASN records and related subnets. Reverse ASN lookup might not be effective with domains hosted on cloud services, but can otherwise provide substantial amount of subdomains. It’s definitely worth a try.

Install

Clone DNSweeper repo:

$ git clone https://github.com/MMquant/DNSweeper.git

Install python headers (needed for pycares):

$ apt install python3.7-dev

As for the requirements, you’ll need aiodns & requests. Install modules from requirements.txt:

$ pip3 install -r requirements.txt

DNSweeper Usage

Use --help to list available options:

python3.6 DNSweeper.py --help
usage: DNSweeper.py [-h]
{enumerate,resolvers,bruteforce,forward_lookup,asn_reverse_lookup,update_resolvers}

DNSsweeper - Asynchronous public DNS auditing tool
optional arguments:
-h, --help show this help message and exit
commands:
{enumerate,resolvers,bruteforce,forward_lookup,asn_reverse_lookup,update_resolvers}
Description
enumerate Perform complete enumeration
resolvers Output filtered resolvers
bruteforce Bruteforce subdomains
forward_lookup Perform forward lookup
asn_reverse_lookup Perform ASN reverse lookup
update_resolvers Update raw resolver list

Sub-options:

python3.6 DNSweeper.py enumerate --help
usage: DNSweeper.py enumerate [-h] (-f FILE | -d DOMAIN) [-p FILE] [-r REGEX]
[-o DIR_PATH] [--use-cache] [--fast-sweep]
[--no-bruteforce] [--bruteforce-recursive FILE]
[--exclude FILE] [-v]
enumerate command prepares filtered public resolvers for given input domain.
It performs bruteforce subdomain discovery. Then it forward-lookup each
subdomain in all filtered resolvers and extracts corresponding ASN netblocks.
Next it performs reverse-lookup in all IPs from discovered ASN netblocks and
optionally filters output with given REGEX.
optional arguments:
-h, --help show this help message and exit
-f FILE Path to file with (scraped) subdomains
-d DOMAIN Domain (ie. test_domain.com)
-p FILE Path to file with bruteforce payload
-r REGEX Reverse lookup regex matching pattern
-o DIR_PATH Path to directory with results
--use-cache Use cached resolvers
--fast-sweep Don't sweep all resolvers. For every subdomain return
just first valid resolver answer.
--no-bruteforce Don't use bruteforce command.
--bruteforce-recursive FILE
Enable recursive bruteforce. Path to payload file must
be specified. Use smaller wordlists.
--exclude FILE File with subdomains which not to enumerate. (improves
speed)
-v Verbosity, -v, -vv, -vvv

Result structure

The results/ directory is created in current working directory which contains files with given command output.

results/
├── asn_reverse_lookup_all_ptr.json
├── asn_reverse_lookup_asn.json
├── asn_reverse_lookup_regex_ptr.json
├── bruteforce_result.json
├── filtered_resolvers_result.json
├── enumerate_unique_subdomains.json
├── forward_lookup_result.json
└── forward_lookup_unique_ips.json

Performance and Examples

Latest commits include some performance optimization tips & tricks which require root or admin privileges so every user should perform them manually before using DNSweeper.

Requests per seconds can be seen in stdout if -vv or -vvv verbosity is selected. There are basically just 2 speeds in DNSweeper ‘--fast-sweep‘ and not ‘--fast-sweep‘.

“Resolving speed has high variance and it strongly depends on ISP, NIC and overall network traffic. In my lab I can achieve 120-240 req/s in no –fast-sweep mode and 500-1200 req/s in –fast-sweep mode.”

We’ll try “newyorktimes.com” (dom_r1). Without any optimizations, on our test server (2 CPU, 8GB, 250Mbps):

$ time python3.6 DNSweeper.py enumerate -f dom_r1 -vv --no_bruteforce --use-cache
  • Reverse lookup for 256 IPs / Chunk Size: 24 / Chunk Size List: 11
  • Forward Lookup: 63 req/s
  • ASN Reverse Lookup: between 5 and 130 req/s
  • Total time: 49s

results/forward_lookup_result.json:

[
{
"name":"newyorktimes.com",
"A":[
"192.0.79.32",
"192.0.79.33"
]
}
]

results/forward_lookup_unique_ips.json:

[
"192.0.79.32",
"192.0.79.33"
]

results/enumerate_unique_subdomains.json

[]

results/asn_reverse_lookup_asn.json:

[
{
"AS":"2635",
"IP":"192.0.79.33",
"BGP Prefix":"192.0.79.0/24",
"CC":"US",
"Registry":"arin",
"Allocated":"2012-11-20",
"Info":"AUTOMATTIC - Automattic, Inc, US"
}
]

Another quick test, “blic.rs”, robtex shows a vast number of subdomains (subdomains):

  • baneri.blic.rs
  • english.blic.rs
  • gameplanet.blic.rs
  • lillydrogerie.blic.rs
  • mail.blic.rs
  • sport.blic.rs
  • trak-analytics.blic.rs
  • vremeplov.blic.rs
  • www.blic.rs
  • zena.blic.rs
$ time python3.6 DNSweeper.py enumerate -f subdomains -vv --use-cache
  • Subdomains payload: 100k [data/bitquark-subdomains-top100K.txt.ascii]
  • Calculated chunk_size list: 4167
  • Average speed: 6 req/s (8000 processed req / 20 min)

Optimization

In order to get DNSweeper working properly you should tune-up your OS so that it can handle thousands of outgoing TCP connections reliably. Usual system defaults are:

net.ipv4.ip_local_port_range = 32768   61000
net.ipv4.tcp_fin_timeout = 60

This basically means your system cannot consistently guarantee more than (61000 – 32768) / 60 = 470 sockets per second.

Reduce time to hold socket (reference) :

$ sysctl net.ipv4.tcp_fin_timeout=30

Increase local port range that is used by TCP and UDP to choose the local port:

$ sysctl net.ipv4.ip_local_port_range="15000 61000"

With that you should see over 1500 outbound connections per second.

Default Sysctl values on a typical Linux box for tcp_tw_reuse would be

net.ipv4.tcp_tw_reuse=0

This doesn not allow a connection from a “used” socket (in wait state) and force the sockets to last the complete time_wait cycle. Recommended:

sysctl net.ipv4.tcp_tw_reuse=1 

This allows fast cycling of sockets in time_wait state and re-using them.

Edit /etc/security/limits.conf file by adding

 <your_username>          soft        nofile      65535
<your_username> hard nofile 65535

You might need restart your system after making these changes. To check if you can run enough concurrent TCP connections run

$ ulimit -Hn

which should be >=25000.

Conclusion

Reliability DNSweeper offers comes with a high cost: speed. It carefully works its way through DNS waters, so if you need to scan a large number of domains/subdomain fast, this probably isn’t the tool for you.

Download Box