ForgeAwareness
Advanced Secure AI/Course player
0 of 6 complete0%
Module 13 min

AI Fine-Tuning and Training Risks

TL;DR

What happens when you fine-tune a model on proprietary data. Data leakage and guardrails.

AI Fine-Tuning and Training Risks

Fine-tuning is powerful: you train a model on your data to get better results on your specific task. It's also risky: you're exposing your data to a third-party model provider.

Risk 1: Data Leakage in Fine-Tuning

When you fine-tune ChatGPT on your proprietary data, OpenAI sees that data. They might use it for training. They might log it. They might be hacked.

The damage: Your proprietary methods, customer data, trade secrets—all in a model provider's database.

Risk 2: Model Extraction

An attacker fine-tunes the model with their own data, then uses it to extract your training data. It's possible and has been demonstrated in research.

Risk 3: Dependency on Third-Party Models

You fine-tune on GPT-4. OpenAI changes the pricing, the API, the terms. You're stuck. Your fine-tuned model is useless if you can't access the base model.

What to Do Instead

  • Use internal models if you need to fine-tune on proprietary data
  • Use open-source models (llama, mistral) that you control
  • Minimize the data you fine-tune on—use only what's necessary
  • Redact sensitive information before fine-tuning
  • Encrypt data in transit and at rest
  • Check the provider's contract—what do they do with your data?

Principle: Fine-tuning on proprietary data is powerful but risky. Only do it if you control the model or trust the provider completely.

Knowledge check

Knowledge check 1

What's the biggest risk of fine-tuning a public AI model on your proprietary data?