ForgeAwareness
Advanced Secure AI/Course player
0 of 6 complete0%
Module 14 min

AI Fine-Tuning and Training Risks

TL;DR

What happens when you fine-tune a model on your company data. Data leakage, model extraction, and how to stay safe.

AI Fine-Tuning and Training Risks

Fine-tuning is powerful: you train a model on your data to get better results on your specific task. It's also risky: you're exposing your company data to a third-party model provider.

Risk 1: Data Leakage in Fine-Tuning

When you fine-tune ChatGPT on your company data, OpenAI sees that data. They might use it for training. They might log it. They might be hacked.

The damage: your company's proprietary methods, customer data, trade secrets — all in a model provider's database.

Risk 2: Model Extraction

An attacker fine-tunes the model with their own data, then uses it to extract your company's training data. It's possible. It's been demonstrated in research.

Risk 3: Dependency on Third-Party Models

You fine-tune on GPT-4. OpenAI changes the pricing, the API, the terms. You're stuck. Your fine-tuned model is useless if you can't access the base model.

What to Do Instead

  • Use your approved chat platform if you need to fine-tune on your company data
  • Use open-source models (llama, mistral) that you control
  • Minimize the data you fine-tune on — use only what's necessary
  • Redact sensitive information before fine-tuning
  • Encrypt data in transit and at rest
  • Check the provider's contract — what do they do with your data?

Principle: Fine-tuning on proprietary data is powerful but risky. Only do it if you control the model or trust the provider completely.

Knowledge check

Knowledge check 1

What's the biggest risk of fine-tuning a public AI model on your company's proprietary data?