GPT-4o is the latest and most advanced language model developed by OpenAI, building upon their previous GPT-4 and GPT-4 Turbo models. The "o" in the name stands for "omni", reflecting the model's expanded multimodal capabilities. Performance Improvements: - Twice as fast as GPT-4 Turbo - 50% cheaper for both input and output tokens ($5 per million input, $15 per million output) - 5x higher rate limit up to 10 million tokens per minute When compared with [[GPT-3.5]] it is better at everything: writing, summarizing, coding, etc. Accessibility: - Available through ChatGPT, OpenAI API, Microsoft Azure, and potentially other platforms - Free access options for individual users, with paid tiers for enterprises A version of GPT-4 is the base of [[Bing Copilot]], a freely available dialogue system associated with Microsoft's Bing search engine. As of July 2024, what is the API cost of OpenAI GPT models per million words? The pricing structure for OpenAI's GPT API versions — GPT-3.5, GPT-4, and GPT-4o — operates on a token-based model, with charges based on the number of tokens processed. As of the latest update, the pricing per 1 million input tokens is as follows: - GPT-3.5: 0.47 EUR - GPT-4: 27.9 EUR - GPT-4o: 4.65 EUR For output tokens, the pricing per 1 million tokens is: - GPT-3.5: 1.40 EUR - GPT-4: 55.80 EUR - GPT-4o: 13.95 EUR [[GPT-3.5]] < [[Hands-on LLMs]]/[[3 Instruction-Tuned LLM Systems]] > [[Bing Copilot]]