Skip to main content

OpenAI DevDay SF

Today at DevDay SF, OpenAI is launching a bunch of new capabilities to the platform.

Realtime API

The new Realtime API, now in public beta, allows paid developers to create low-latency, speech-to-speech experiences in apps, similar to ChatGPT’s Advanced Voice Mode. It supports real-time streaming of audio inputs and outputs, offering more natural and responsive conversations. Alongside this, an update to the Chat Completions API introduces audio input and output, supporting multimodal interactions with text or audio responses. These updates simplify the process for developers by consolidating speech recognition, text processing, and speech synthesis into a single API call, enhancing use cases like customer support and language learning.

Prompt Caching

Prompt Caching, introduced today, allows developers to reuse recently seen input tokens across multiple API calls, reducing both costs and latency. This feature provides a 50% discount and faster prompt processing times. Prompt Caching is automatically applied to the latest versions of GPT-4o, GPT-4o mini, o1-preview, o1-mini, and their fine-tuned counterparts.

Model Distillation

OpenAI is introducing a new Model Distillation offering, providing developers with an integrated workflow to manage the entire distillation pipeline directly within the platform. Model distillation fine-tunes smaller, cost-efficient models using outputs from more capable models, improving performance at a lower cost. This suite simplifies the previously complex, multi-step process with three key features: Stored Completions, which captures input-output pairs to build datasets for fine-tuning; Evals, a tool for custom performance evaluations; and seamless integration with OpenAI’s fine-tuning services. This offering reduces manual effort and streamlines model optimization.

Vision Fine-Tuning

OpenAI has introduced vision fine-tuning on GPT-4o, allowing developers to fine-tune the model using images in addition to text. This enhances the model’s image understanding capabilities, enabling applications such as improved visual search, better object detection for autonomous systems, and more accurate medical image analysis. While many developers have used text-only fine-tuning to improve task-specific performance, the addition of image fine-tuning addresses the limitations of text-based models for more complex, visual tasks.

For more news like this: thenextaitool.com/news


Comments

Popular posts from this blog

DeepSeek AI Unveils Revolutionary Reasoning Model

 China's DeepSeek AI Launches Transparent, High-Performance Model, Signaling a New Era in AI Innovation and Competition In a significant advancement from China's tech sector, DeepSeek AI has launched the DeepSeek-R1-Lite-Preview , an innovative AI reasoning model set to rival major players like OpenAI. This tool represents a substantial improvement in AI analytical capabilities, especially for tasks that involve complex reasoning and decision-making. By exceeding benchmarks such as AIME and MATH, DeepSeek-R1-Lite-Preview demonstrates its effectiveness and reliability in solving intricate problems. So, what differentiates DeepSeek ? Transparency. Users can observe its reasoning process in real-time, providing clear insights that aid in making informed decisions. Researchers and developers are excited because they anticipate open-source models and an API, which will allow them to integrate DeepSeek ’s capabilities into their own projects, fostering numerous opportunities for i...

ChatGPT's Mysterious Block on 'David Mayer' Draws Public Curiosity

Unveiling The Mystery of ChatGPT's Hesitance To Refer To 'David Mayer' And Its Wider Implications For AI Transparency In a curious twist, ChatGPT , the popular AI chatbot developed by OpenAI -maybe we should say CloseAI- has reportedly been displaying an unusual reluctance to mention the name 'David Mayer'. Users from various online platforms have voiced their concerns, confirming that when prompted with this name, ChatGPT tends to either cut conversations short or misdirect them, offering alternative answers that skirt around the main topic. Despite the growing buzz and speculation, OpenAI has yet to release an official statement explaining this behavior. The name 'David Mayer' might not ring a bell for everyone, but the context surrounding the mystery reveals links that suggest deeper layers. David de Rothschild, a prominent figure often referred to as David Mayer de Rothschild, is a British adventurer, environmentalist, and film producer. He is notably a...

Meta AI Releases Llama 3.3 70B Instruct

 Explore Llama 3.3 70B Instruct As It Sets New Standards In AI With Enhanced Reasoning, Multilingual, And Cost-Efficient Features Meta has introduced the Llama 3.3 70B Instruct, an advanced AI model that sets a new benchmark in reasoning, coding, and following instructions. As one of the most adaptable open models available, Llama 3.3 70B brings forth impressive capabilities with a wide array of applications. Enhanced Functionality and Multilingual Support This model shines in producing structured outputs, especially in step-by-step reasoning and JSON formatting, ensuring reliability and accuracy for developers. With support for eight major languages, including English, French, and Hindi, it aims to facilitate global communication. Revolutionizing Software Development The improvements in coding encompass extensive language support, better error handling, and comprehensive feedback, enabling developers to enhance their productivity. Llama 3.3 ’s task-aware tool usage optimizes re...