10. Data Poisoning / Model Drift via Auto-Learning

10. Data Poisoning / Model Drift via Auto-Learning

If MCP agents fine-tune on user input without filtering, malicious prompt sequences can shift behavior over time.

Impact: Gradual normalization of insecure responses or biased outputs.

Mitigation: 

  • No auto-training on live, unsanitized data
  • Add poisoned input detection + human-in-the-loop review
ON THIS PAGE