10. Data Poisoning / Model Drift via Auto-Learning
10. Data Poisoning / Model Drift via Auto-Learning
If MCP agents fine-tune on user input without filtering, malicious prompt sequences can shift behavior over time.
Impact: Gradual normalization of insecure responses or biased outputs.
Mitigation:
- No auto-training on live, unsanitized data
- Add poisoned input detection + human-in-the-loop review