Gpt-oss: OpenAI's New Model for Azure and Windows AI

Microsoft and OpenAI have just revealed a significant update for Windows and Azure users. It’s called GPT-oss, a new open‑source style AI model built to run on Microsoft’s platforms. This release marks the first time OpenAI is offering a model that developers can freely build on and customize inside the Azure AI Foundry and Windows AI ecosystem.

Why does this matter? We’re entering a phase where AI isn’t just for cloud giants. With GPT-OS, developers, startups, and even students can experiment locally on Windows or scale in the cloud through Azure. It’s designed for flexibility; you can fine‑tune it, deploy it in hybrid setups, and integrate it into apps faster.

We’ll explore what GPT-OSs offers, how it fits Microsoft’s AI plans, and why this could shape the next wave of AI tools on Windows.

What is GPT-oss?

GPT-OS is an open‑weight language model. The full model weights have been made available to the public under the Apache 2.0 license.

There are two sizes:

gpt‑oss‑120b (~120 billion parameters)
gpt‑oss‑20b (~20 billion parameters).
GPT-oss offers performance levels close to OpenAI’s closed models, including o4‑mini and o3‑mini, based on benchmark results.

It supports reasoning, coding, web browsing, math, and agent tasks.
Thanks to chain‑of‑thought designs, it shows its reasoning steps, making it more transparent and safe.

OpenAI and Microsoft Partnership Context

OpenAI and Microsoft have maintained a strong collaboration since 2019. Microsoft powers OpenAI’s models on Azure supercomputers and integrates them into products like Copilot and GitHub.
GPT-oss is now available through Azure AI Foundry and Windows AI Foundry. Microsoft CEO Satya Nadella said this hybrid AI setup shows innovation in action: users can run models on their machine or in Azure’s cloud.
This launch reflects Microsoft’s strategy to integrate multiple model providers, including DeepSeek’s R1 and Elon Musk’s Grok, into Azure AI Foundry.

Key Features and Capabilities of gpt‑oss

Performance & Size:

GPT-OS 120B is highly capable but can operate on just one 80 GB GPU.
GPT-OS 20B is capable of running on a PC equipped with 16 GB of memory.

Windows AI Integration:

Developers can deploy directly using Windows AI Foundry Local for offline use on Windows PCs.

Azure-native deployment:

Azure AI Foundry provides built-in tools to help fine-tune and deploy GPT-OS models.

Customization:

Users can inspect and adapt weights for specialized tasks, using fine-tuning.
LoRA-style adaptation can reduce compute needs for customization.

Security and compliance:

OpenAI performed internal misuse testing and external audits to assess risk levels.
Its chain-of-thought transparency aids in safety monitoring.

Open‑Source Strategy: How “OSS” Is Implemented

The “OSS” in gpt‑oss means open weights, not full open-source code or training data. The weights are provided under the Apache 2.0 license, allowing users to run, distribute, and use the model commercially without restrictions.

This doesn’t include training code or datasets, but still allows fine-tuning and inspection.
Compared to full open-source LLMs like Meta’s LLaMA or Mistral Small, gpt‑oss focuses on reasoning and enterprise-grade features.

Benefits for Developers and Enterprises

Developers:

We can use GPT-OS on our own PCs or on Azure.
The 20B version fits into many machines.
It brings powerful AI tools into reach without cloud costs.

Enterprises:

A hybrid deployment means better data control and lower latency.
Pricing is flexible, run locally or buy compute via Azure.
AWS now also offers GPT-OS on Bedrock and SageMaker, expanding its availability.

Innovation:

Open access means startups and students can build on GPT-OS.
We expect faster development of new apps and agents thanks to this flexibility.

Potential Challenges and Limitations

Compute requirement:

The 120B version needs high GPU memory.
Not all users can run it easily.

Safety risk:

Open weights make misuse possible.
Controls help, but risk remains without proper oversight.

Competition:

Rivals like Meta’s LLaMA, DeepSeek’s R1, and Mistral offer similar options, sometimes faster or smaller.

Enterprise adoption:

Integrating GPT-OS into existing systems takes work.
Training staff and fine-tuning models adds time and cost.

Future Outlook and Roadmap

We expect OpenAI and Microsoft to continue improving GPT-OS

Might add multimodal support (vision, audio).
Further integrate with Windows Copilot and Azure ML tools.
Possible lightweight versions for edge AI devices.
Microsoft may expand GPT-OS into future Copilot and Windows features.
OpenAI’s move suggests more open-weight releases ahead. That could drive broad global innovation.

Conclusion

GPT-OS offers a rare blend of power and openness. It opens doors for developers and enterprises to build custom AI tools on Windows or Azure. With two model sizes, transparent reasoning, and enterprise-ready deployment, it bridges the gap between closed proprietary models and fully open systems.
We expect GPT-OST to spur innovation, especially where offline, secure, and customizable AI is key. If you’re building AI tools in Microsoft ecosystems, Gpt‑oss is worth exploring.

FAQS:

Which new AI version has been recently launched by OpenAI, GPT?

OpenAI introduced GPT-OS, an AI model released with open weights. It comes in two sizes, 120B and 20B, and works on Azure and Windows for developers and enterprises.

How are ChatGPT, OpenAI, I, and Azure OpenAI related?

OpenAI builds ChatGPT, and Microsoft Azure hosts these models on its cloud. This partnership lets users access OpenAI’s models like GPT-4 and GPT-OSs directly through Azure AI services.

What is the latest model of ChatGPT?

The latest ChatGPT version is based on GPT‑4.1. It’s more accurate, faster, and supports reasoning tasks. GPT-OSS is distinct yet integrates smoothly with both Azure and Windows ecosystems.

Disclaimer:

This content is for informational purposes only and not financial advice. Always conduct your research.