Microsoft Phi is a family of small language models (SLMs) that deliver surprisingly strong performance relative to their compact size. Ranging from 1.5B to 14B parameters, Phi models are designed for scenarios where larger models are impractical due to latency, cost, or hardware constraints.
Key use cases include:
Phi models are used by teams building products that need AI capabilities without cloud dependency — mobile developers, hardware manufacturers, and enterprises deploying at the edge. Phi-3.5 Mini (3.8B) matches or exceeds many 7B models on standard benchmarks.
Available under the MIT license, Phi models can be freely used in commercial applications with no restrictions.
microsoft/phi-*).# Ollama (simplest)
ollama run phi3.5
# Transformers
pip install transformers torch
from transformers import pipeline
pipe = pipeline('text-generation', model='microsoft/Phi-3.5-mini-instruct')
pipe('Hello!')Pricing: Model weights are free (MIT license). No API key needed for local use. Azure AI Studio endpoints are billed per token through your Azure subscription. Self-hosting requires only modest GPU resources (4-8GB VRAM for smaller variants).
Be the first to share a Microsoft Phi case study and get discovered by clients.
Submit a case studySubmit a brief and we'll match you with vetted specialists who have proven Microsoft Phi experience.