https://openai.com/blog/gpt-3-5-turbo-fine-tuning-and-api-updates Close [ ] Search Submit Skip to main content Site Navigation * Research + Overview + Index * Product + Overview + ChatGPT + GPT-4 + DALL*E 2 + Customer stories + Safety standards + API data privacy + Pricing * Developers + Overview + Documentation + API reference + Examples * Safety * Company + About + Blog + Careers + Charter + Security Search Navigation quick links * Log in * Sign up Menu Mobile Navigation Close Site Navigation Research Product Developers Safety Company Quick Links * Log in * Sign up [ ] Search Submit GPT-3.5 Turbo fine-tuning and API updates Developers can now bring their own data to customize GPT-3.5 Turbo for their use cases. Gpt 3.5 Turbo Fine Tuning And Api Updates Illustration: Ruby Chen August 22, 2023 Authors * Andrew Peng * Michael Wu * John Allard * Logan Kilpatrick * Steven Heidel Fine-tuning for GPT-3.5 Turbo is now available, with fine-tuning for GPT-4 coming this fall. This update gives developers the ability to customize models that perform better for their use cases and run these custom models at scale. Early tests have shown a fine-tuned version of GPT-3.5 Turbo can match, or even outperform, base GPT-4-level capabilities on certain narrow tasks. As with all our APIs, data sent in and out of the fine-tuning API is owned by the customer and is not used by OpenAI, or any other organization, to train other models. Fine-tuning use cases Since the release of GPT-3.5 Turbo, developers and businesses have asked for the ability to customize the model to create unique and differentiated experiences for their users. With this launch, developers can now run supervised fine-tuning to make this model perform better for their use cases. In our private beta, fine-tuning customers have been able to meaningfully improve model performance across common use cases, such as: * Improved steerability: Fine-tuning allows businesses to make the model follow instructions better, such as making outputs terse or always responding in a given language. For instance, developers can use fine-tuning to ensure that the model always responds in German when prompted to use that language. * Reliable output formatting: Fine-tuning improves the model's ability to consistently format responses--a crucial aspect for applications demanding a specific response format, such as code completion or composing API calls. A developer can use fine-tuning to more reliably convert user prompts into high-quality JSON snippets that can be used with their own systems. * Custom tone: Fine-tuning is a great way to hone the qualitative feel of the model output, such as its tone, so it better fits the voice of businesses' brands. A business with a recognizable brand voice can use fine-tuning for the model to be more consistent with their tone. In addition to increased performance, fine-tuning also enables businesses to shorten their prompts while ensuring similar performance. Fine-tuning with GPT-3.5-Turbo can also handle 4k tokens--double our previous fine-tuned models. Early testers have reduced prompt size by up to 90% by fine-tuning instructions into the model itself, speeding up each API call and cutting costs. Fine-tuning is most powerful when combined with other techniques such as prompt engineering, information retrieval, and function calling. Check out our fine-tuning guide to learn more. Support for fine-tuning with function calling and gpt-3.5-turbo-16k will be coming later this fall. Fine-tuning steps Step 1 Prepare your data { "messages": [ { "role": "system", "content": "You are an assistant that occasionally misspells words" }, { "role": "user", "content": "Tell me a story." }, { "role": "assistant", "content": "One day a student went to schoool." } ] } Step 2 Upload files curl -https://api.openai.com/v1/files \ -H "Authorization: Bearer $OPENAI_API_KEY" \ -F "purpose=fine-tune" \ -F "file=@path_to_your_file" Step 3 Create a fine-tuning job curl https://api.openai.com/v1/fine_tuning/jobs \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $OPENAI_API_KEY" \ -d '{ "training_file": "TRAINING_FILE_ID", "model": "gpt-3.5-turbo-0613", }' Once a model finishes the fine-tuning process, it is available to be used in production right away and has the same shared rate limits as the underlying model. Step 4 Use a fine-tuned model curl https://api.openai.com/v1/chat/completions \ -H "Content-Type: application/json" \ -H "Authorization: Bearer $OPENAI_API_KEY" \ -d '{ "model": "ft:gpt-3.5-turbo:org_id", "messages": [ { "role": "system", "content": "You are an assistant that occasionally misspells words" }, { "role": "user", "content": "Hello! What is fine-tuning?" } ] }' We will also be debuting a fine-tuning UI in the near future, which will give developers easier access to information about ongoing fine-tuning jobs, completed model snapshots, and more. Safety It is very important to us that the deployment of fine-tuning is safe. To preserve the default model's safety features through the fine-tuning process, fine-tuning training data is passed through our Moderation API and a GPT-4 powered moderation system to detect unsafe training data that conflict with our safety standards. Pricing Fine-tuning costs are broken down into two buckets: the initial training cost and usage cost: * Training: $0.008 / 1K Tokens * Usage input: $0.012 / 1K Tokens * Usage output: $0.016 / 1K Tokens For example, a gpt-3.5-turbo fine-tuning job with a training file of 100,000 tokens that is trained for 3 epochs would have an expected cost of $2.40. Updated GPT-3 models In July, we announced that the original GPT-3 base models (ada, babbage, curie, and davinci) would be turned off on January 4th, 2024. Today, we are making babbage-002 and davinci-002 available as replacements for these models, either as base or fine-tuned models. Customers can access those models by querying the Completions API. These models can be fine-tuned with our new API endpoint /v1/ fine_tuning/jobs. This new endpoint offers pagination and more extensibility to support the future evolution of the fine-tuning API. Transitioning from /v1/fine-tunes to the updated endpoint is straightforward and more details can be found in our new fine-tuning guide. This deprecates the old /v1/fine-tunes endpoint, which will be turned off on January 4th, 2024. Pricing for base and fine-tuned GPT-3 models is as follows: Base models Fine-tuned models Model Input tokens Output tokens Training Input tokens Output tokens babbage-002 $0.0004 / 1K tokens $0.0004 / 1K tokens $0.0004 / 1K tokens $0.0016 / 1K tokens $0.0016 / 1K tokens davinci-002 $0.002 / 1K tokens $0.002 / 1K tokens $0.006 / 1K tokens $0.012 / 1K tokens $0.012 / 1K tokens Authors * Andrew Peng View all articles * Michael Wu View all articles * John Allard View all articles * Logan Kilpatrick View all articles * Steven Heidel View all articles Acknowledgments Andrea Vallone, Arvind Neelakantan, Cindy Yong, Colin Jarvis, Denny Jin, Florencia Leoni Aleman, Henry Head, Ilan Bigio, Jeff Harris, Jessica Shieh, Kim Malfacini, Lauren Workman, Lilian Weng, Olivier Godement, Sherwin Wu, Shyamal Anadkat, Vik Goel, Yuchen He Research * Overview * Index Product * Overview * ChatGPT * GPT-4 * DALL*E 2 * Customer stories * Safety standards * API data privacy * Pricing Safety * Overview Company * About * Blog * Careers * Charter * Security OpenAI (c) 2015 - 2023Terms & policiesPrivacy policyBrand guidelines Social * Twitter * YouTube * GitHub * SoundCloud * LinkedIn Back to top