MCPThreatHive: Automated Threat Intelligence for MCP Ecosystems

Introduction

MCPThreatHive is an attempt to systematize security for Model Context Protocol (MCP) ecosystems by turning it from a set of separate tools into a full-fledged pipeline for vulnerability discovery.

Unlike classic tools, the system is not focused on one-off checks of MCP servers, but on continuous threat intelligence collection, automatic classification, and building relationships between threats.

The main idea is to treat MCP security as a dynamic environment where threats constantly appear, combine with each other, and require ecosystem-level analysis rather than analysis of individual components.

The paper highlights three fundamental limitations of existing MCP security solutions:

Compositional attack modeling is not taken into account
Attacks emerge not from one tool, but from their combination.
No continuous vulnerability scanning
Most tools operate as one-time analysis at a specific point in time.
No unified classification
Separate frameworks are used, but there is no unified layer.

MCPThreatHive is positioned as a system that closes all three gaps at the same time.

Architecture

The system is built as a four-stage pipeline:

Intelligence Gathering
Data collection from:
- RSS (ArXiv, security blogs)
- NVD
- GitHub
- Web
AI Threat Analysis
LLM:
- classifies threats (MCP-38)
- maps them to STRIDE and OWASP
- assesses risk
Structured Storage
- SQL database
- Knowledge graph (Neo4j)
Visualization & Risk Planning
- Threat matrix
- 3D visualization
- Risk planning

In practice, this architecture reflects a transition from a scanner to an analytics platform, while the paper itself highlights a key MCP feature: the attack happens not at the code level, but at the semantic level.

“System architecture”

The main attack classes are also identified:

Indirect Prompt Injection
Tool Description Poisoning
Parasitic Tool Chains
Preference Manipulation
Dynamic Mutation

The key idea is that an LLM makes decisions based on text, therefore any text can become an attack.

MCP-38

One of the main advantages of the system is its use of the MCP-38 taxonomy, specifically the “MCP-17” model (parasitic tool chains):

The attack is divided into stages:

Ingestion (instruction injection)
Collection (data collection)
Disclosure (exfiltration)

This makes it possible to analyze tool chains rather than individual calls, which is critical for an agentic system.

Knowledge Graph

Instead of classic tables, a graph is used:

Node types:

Threat
Tool
CVE
Mitigation
Intelligence Item

Edge types:

EXPLOITS
CHAINS_INTO
MITIGATED_BY

This makes it possible to find attacks in several steps, analyze dependencies, and build attack chains, which fully justifies the graph-based approach.

Scoring

The following is used to calculate the “weight” of an attack:

Factors are calculated:
- Impact
- Success rate
- Persistence
- Ease of exploitation
MCP-specific multipliers are introduced:
- semantic attacks
- multi-tool chains
- low observability
The final score is derived from 0 to 10:
- Low
- Medium
- High
- Critical

Limitations

The paper explicitly acknowledges weaknesses:

- hallucinations
- classification errors
- instability
- a large output budget is required (about 12k tokens)
- problems with non-standard sources
- multilingual data
- aggressive descriptions may be treated as attacks

Summary

MCPThreatHive is a strong conceptual step toward systematizing MCP threats and analyzing compositional attacks. Moving to a graph-centric security model gives an obvious advantage for building multi-component attacks. The development and use of a custom threat model, MCP-38, is also worth noting, as it creates a foundation for deep and long-term study of this area.

However, the early stage of the repository and its high dependence on LLMs introduce their own limitations and are unlikely to allow the current system to reach the level of a runtime protection mechanism or a full product solution.

The project is well suited for research teams, security architects, and threat modeling tasks. In practice, it should be used as an audit tool on top of existing solutions.