Active ThreatMay 20, 202610 min read

How to Build a Threat Hunting Program from Scratch

Sources:SANS Threat Hunting Survey 2025|TaHiTI Threat Hunting Methodology|MITRE ATT&CK Data Sources

Eric Bang

Founder & Cybersecurity Evangelist

86%

Of breaches with long dwell times could have been detected earlier by proactive hunting

29 days

Average time hunting programs reduce mean time to detect

58%

Of threat hunting programs operate without a formal hypothesis methodology

3.5x

More likely to detect insider threats with a proactive hunting program

Reactive detection — waiting for your SIEM to fire an alert — misses a predictable class of threats: patient adversaries who operate within alert thresholds, living-off-the-land attackers who blend into normal traffic, and compromised insiders who use legitimate access. Threat hunting exists to find these threats before they cause damage.

But most 'threat hunting' in practice is ad hoc: an analyst searches for something suspicious, does not find anything, and moves on. There is no structured hypothesis, no repeatable workflow, and no way to know whether the hunt was thorough or whether the absence of findings means nothing was there or the hunt was inadequate.

This guide covers what a real hunting program looks like: structured methodology, data requirements, tooling, team structure, and the metrics that tell you whether your program is actually improving detection.

Free daily briefing

Briefings like this, every morning before 9am.

Threat intel, active CVEs, and campaign alerts — distilled for practitioners. 50,000+ subscribers. No noise.

Prerequisites: Data Sources and Telemetry Requirements

You cannot hunt for what you cannot see. Before investing in a hunting program, assess your telemetry coverage against the data sources required to detect the techniques you plan to hunt for.

The minimum viable telemetry baseline for an effective hunting program includes: endpoint process execution logs with command-line arguments (Sysmon event ID 1, or EDR process telemetry), network connections with process-to-connection mapping (Sysmon event ID 3), authentication events with source IP and target system (Windows Security Event 4624, 4625, 4648), PowerShell script block logging (Event 4104), DNS query logs with source host, and authentication events from your identity provider (Okta, Azure AD, etc.).

Map your available data sources against ATT&CK's data source model before defining your hunting scope. If your telemetry does not include process access events (Sysmon event ID 10), you cannot reliably hunt for credential dumping from LSASS. If you do not have DNS query logs correlated to endpoints, you cannot hunt for C2 beaconing via DNS. The data sources you are missing define the techniques you cannot hunt for — and therefore the gaps in your coverage that hunting cannot fill.

Building a Hypothesis-Driven Hunting Methodology

A hunting hypothesis is a falsifiable statement about adversary behavior that you can test against your data. 'Check if there is any malware' is not a hypothesis. 'Adversaries using T1078 (Valid Accounts) for lateral movement would authenticate to multiple workstations within a short time window using the same account — let's search for accounts with more than 5 distinct authentication targets within 60 minutes' is a hypothesis.

Effective hypotheses come from three sources. Threat intelligence: what techniques are threat actors targeting your industry currently using? IOCs and TTPs from recent campaigns against your sector become hypothesis inputs. ATT&CK coverage gaps: what techniques in your ATT&CK coverage map are you not detecting with automated rules? Hunt for those manually. Environmental knowledge: what would anomalous behavior look like in your specific environment? A hunt that works for a financial services company may not be relevant for a manufacturing organization where OT system communication patterns are completely different.

Document hypotheses before executing hunts. Each hunt entry should include: the hypothesis statement, the ATT&CK technique it targets, the data sources required, the query or analytic used, the timeframe searched, the findings (including true negative conclusions), and any new automated detections created from the hunt results. This documentation creates an audit trail and prevents the same hunt from being run repeatedly by different analysts without building on prior work.

Tooling and Analyst Skill Requirements

Threat hunting does not require specialized tools beyond what most mature security programs already have. The minimum toolset is: a SIEM or data lake with sufficient retention (90 days minimum, 12 months preferred), an EDR with rich process and network telemetry, and a query interface that supports complex analytical queries.

For SIEM-based hunting, Splunk's Search Processing Language (SPL), Elastic's Event Query Language (EQL), and Microsoft Sentinel's Kusto Query Language (KQL) all support the types of behavioral analytics required for structured hunting. EQL deserves specific mention for endpoint hunting: its sequence operator allows queries that express multi-event attack chains (event A followed by event B on the same host within a time window) that are difficult to express in other query languages.

Analyst skill requirements are the more significant constraint for most programs. Effective hunters need: proficiency with the chosen query language, deep knowledge of Windows, macOS, or Linux internals (depending on hunting scope), familiarity with attacker tools and techniques (Mimikatz, Cobalt Strike, LOLBAS), and the ability to distinguish anomalous from merely unusual in your specific environment. These skills take months to develop. Plan for six to twelve months of structured development time for analysts transitioning from reactive alert triage to proactive hunting.

Measuring Hunting Program Effectiveness

The most common failure mode in hunting programs is operating without metrics — running hunts, finding nothing, and declaring that nothing was found without being able to distinguish 'nothing was there' from 'the hunt was not thorough enough.'

Track four metrics for hunting program health. Hunt coverage rate: what percentage of your ATT&CK coverage gap has been hunted at least once in the past quarter? This measures whether the program is systematically working through priorities or just hunting opportunistically. Hypothesis-to-detection conversion rate: what percentage of completed hunts result in a new automated detection rule being written? Low conversion rates indicate either excellent existing coverage (unlikely for a new program) or hunts that are not generating actionable findings. Mean dwell time before hunt detection: when a hunt does uncover active adversary activity, how long had they been present before detection? Trending this metric downward over time demonstrates program impact. False negative audit rate: periodically run known-malicious techniques (in a controlled test environment) and check whether your detections fire. If hunts repeatedly come up clean for techniques where test executions confirm the detection fires, your telemetry coverage is solid. If test executions reveal detection gaps, the hunt findings are unreliable.

Report hunting program metrics to security leadership quarterly alongside detection program metrics. Hunting ROI is demonstrated by showing that proactive hunts uncovered threats that automated detections would have missed — and by trending that gap over time as hunting findings drive new detection development.

Assess data source coverage before defining hunting scope

Hunting against missing telemetry produces false negatives. Map your data sources against ATT&CK before committing to technique coverage.

Document every hypothesis before executing the hunt

Undocumented hunts cannot be built on, compared, or used to demonstrate program value over time.

Convert every successful hunt finding into an automated detection

The primary output of a hunting program is not findings — it is improved automated detection coverage.

Measure hypothesis-to-detection conversion rate

Hunts that never produce new detections are either redundant or not generating actionable conclusions.

Plan 6 to 12 months for analyst skill development

Effective hunting requires significantly different skills than alert triage. The transition takes real investment time.

Subscribe to unlock Remediation & Mitigation steps

Free subscribers unlock full IOC lists, remediation steps, and every daily briefing.

The bottom line

A hunting program produces two outputs: findings from current adversary activity in your environment, and improved automated detection coverage for the future. Teams that treat hunting as only the former miss most of the long-term value. Every successful hunt should close a gap in your automated detection layer so that the same technique does not require manual hunting next time. Over 12 to 24 months, a well-run program systematically eliminates its own highest-value work by converting hunt findings into detections.

Frequently asked questions

What is the difference between threat hunting and threat detection?

Threat detection is reactive: automated rules and analytics alert on known-bad patterns. Threat hunting is proactive: analysts actively search for adversary activity using behavioral hypotheses, looking specifically for threats that automated detections would miss. Hunting operates on the assumption that adversaries may already be present in the environment without having triggered any alerts. The two capabilities are complementary — hunting findings should continuously improve automated detection coverage.

How much time should analysts spend on threat hunting vs alert triage?

For most SOC teams, the right ratio depends on alert volume and existing detection coverage maturity. Teams with high alert volume and immature detection coverage should invest in improving detection quality before building a hunting program — hunting on top of alert fatigue is not sustainable. For teams with manageable alert queues and mature detection coverage, allocating 20 to 30% of senior analyst time to structured hunting is a reasonable starting point. This percentage can increase as hunting drives automation that reduces alert triage burden.

What data retention do you need for threat hunting?

90 days is the minimum retention needed to hunt for threats with normal dwell times. 12 months is the preferred retention for hunting APT-style intrusions with long dwell times and retroactive IOC-based hunting after new threat intelligence is received. 24 months of retention allows hunting against newly published CVE exploitation techniques to check whether exploitation occurred before the public disclosure. Retention cost modeling should account for different tiers: keep high-fidelity telemetry (endpoint, identity, DNS) in hot storage and route lower-value sources to cold storage queried only on demand.

Do I need a dedicated threat hunting team or can analysts hunt part-time?

Most organizations start with part-time hunting by senior analysts and grow into dedicated hunters as the program matures and proves value. Dedicated full-time hunters are justified when the organization has sufficient telemetry breadth and depth to generate meaningful hunting hypotheses continuously, when alert triage is sufficiently automated that senior analysts have dedicated time available, and when hunting findings are consistently producing new detections rather than clean results. Part-time hunting programs that lack structured methodology and documentation often produce neither findings nor program improvements.

Sources & references

Free resources

Free download

Critical CVE Reference Card 2025–2026

25 actively exploited vulnerabilities with CVSS scores, exploit status, and patch availability. Print it, pin it, share it with your SOC team.

Free download

Ransomware Incident Response Playbook

Step-by-step 24-hour IR checklist covering detection, containment, eradication, and recovery. Built for SOC teams, IR leads, and CISOs.

Free newsletter

Get threat intel before your inbox does.

50,000+ security professionals read Decryption Digest for early warnings on zero-days, ransomware, and nation-state campaigns. Free, weekly, no spam.

Unsubscribe anytime. We never sell your data.

Author

Eric BangCISSP

Founder & Cybersecurity Evangelist, Decryption Digest

Cybersecurity professional with expertise in threat intelligence, vulnerability research, and enterprise security. Covers zero-days, ransomware, and nation-state operations for 50,000+ security professionals weekly.

View profile →LinkedIn

Back to all briefings

Subscribe for Updates

threat hunting threat hunting program hypothesis-driven hunting SOC security operations threat detection MITRE ATT&CK hunting proactive threat hunting hunting metrics