Updated guide ยท 2026

How to access Qwen - chat, apps & API

Three ways to use Alibaba's Qwen AI: a free browser chat, native desktop and mobile apps, and an OpenAI-compatible API. This guide walks through every path - plus a full step-by-step install guide.

No credit card to start chatting ยท ~5 minutes to a working API key ยท OpenAI-compatible

3 ways
To access Qwen
Free
Chat, no signup
1M + 1M
Free API tokens
~5 min
To first API call
Pick your path

Three ways to access Qwen

Which door you walk through depends on one question: are you here to chat, or here to build?

Qwen is Alibaba Cloud's family of large language models, and there is no single "Qwen app" you have to install before you can use it. Instead there are three distinct front doors, each suited to a different kind of person. Most people only ever need one of them, so the first job is figuring out which one is yours.

๐Ÿ’ฌ

Qwen Chat

FOR: everyone

The free web chatbot at chat.qwen.ai. No install, no payment, no API key. Open it in any browser and start typing - upload PDFs, paste screenshots, search the web.

โฑ Under 30 seconds ยท ๐Ÿ†“ Free
๐Ÿ–ฅ๏ธ

Desktop & mobile apps

FOR: daily users

Native apps for Windows, macOS, Linux, iOS, and Android. Same account and chat history as the web, plus voice input and camera-based vision on mobile.

โฑ A few minutes ยท ๐Ÿ†“ Free
๐Ÿ”‘

The Qwen API

FOR: developers

Programmatic access via Alibaba Cloud Model Studio (DashScope). OpenAI-compatible, so existing code works with a base-URL swap. This is what the install guide below covers.

โฑ ~5 minutes ยท ๐Ÿ†“ Free tier, then pay-per-token
Not sure? If you just want to ask the AI questions, write, or analyse documents and images, use Qwen Chat - skip everything technical. If you are a developer building Qwen into your own app, script, or product, head to the install guide. The two are completely separate; you do not need an API key to use the chat.
The easiest path

Access Qwen Chat in your browser

The fastest way to use Qwen needs no download, no install, and no payment - you can be chatting in under thirty seconds.

Qwen Chat is the official conversational interface for the Qwen models, built and maintained by Alibaba's Tongyi Lab. It is a free, web-based workspace where you can hold long multi-turn conversations, upload documents and images, generate and run code, search the live web with citations, and even produce interactive artifacts - all in the same window, with no setup required. For the overwhelming majority of people, this is the only access method they will ever need.

1

Open chat.qwen.ai

Go to chat.qwen.ai in any modern browser. It works on desktop, tablet, and phone - no app required. You can begin typing immediately as a guest.

2

Sign in (optional, but recommended)

Continue as a guest, or sign in with Google, GitHub, Apple, or email. Signing in is free and unlocks conversation history that syncs across devices, file uploads, and higher usage limits.

3

Pick a model

The default is the latest flagship Qwen model - the smartest all-rounder. Use the model picker at the top to switch to a specialist when it helps: a coding-tuned model for programming, a vision model for images, or a math model for proofs and equations.

4

Start working

Type a question, drag in a PDF, paste a screenshot, or enable web search for anything time-sensitive. Everything happens in one continuous conversation, so you can mix text, files, and images freely.

Try this first. Drag a PDF into the chat window and ask it to summarize the three most important points. It showcases Qwen Chat's multimodal strength better than any feature tour - and confirms your access is working end to end.
Everywhere you work

Desktop & mobile apps

The same account, chats, and prompts sync automatically across every platform - so you can start on your laptop and finish on your phone.

If you use Qwen often, a native app is more convenient than keeping a browser tab open. The desktop apps give you a dedicated window, keyboard shortcuts, and quicker file drops; the mobile apps add voice input and camera-based vision Q&A, so you can photograph a whiteboard or a document and ask about it on the spot. Whichever platform you choose, signing in with the same account keeps your full conversation history in sync.

๐ŸŒ
Web
chat.qwen.ai
๐Ÿ–ฅ๏ธ
Windows
Desktop app
๐ŸŽ
macOS
Desktop app
๐Ÿง
Linux
AppImage
๐Ÿ“ฑ
iOS
App Store
๐Ÿค–
Android
Google Play
๐Ÿ“ฆ
Android APK
Direct download
๐Ÿ”Œ
Chrome
Extension

Installing the desktop app is the usual three-step affair: download the installer for your operating system from the official Qwen site, run it (on macOS, drag the app into your Applications folder; on Linux, mark the AppImage executable and launch it), then sign in with your Qwen account. On mobile, search for the official Qwen app on the App Store or Google Play, or sideload the APK on Android if you prefer. Always download from the official source rather than a third-party mirror, since unofficial builds can be tampered with.

Verify the source. Only install Qwen apps from the official website or the official App Store / Google Play listings. AI tools are a popular target for lookalike scam apps that harvest credentials. If a download link or app name looks even slightly off, stop and confirm it against the official Qwen site first.
Install guide - for developers

Set up the Qwen API, step by step

From zero to a working API call in about five minutes - get a key, install the SDK, point it at the right endpoint, and send your first request.

The Qwen API is delivered through Alibaba Cloud Model Studio, the unified developer platform for everything Qwen. Under the hood the API surface is called DashScope - you will see both names in the documentation, and they refer to the same service. The key thing for developers is that the API is OpenAI-compatible: it works with the standard openai Python package, the OpenAI JS SDK, LangChain, LiteLLM, and anything else that speaks the OpenAI protocol. Migrating existing code usually means changing just three things - the base URL, the API key, and the model name.

Part 1 ยท Get your API key

1

Create an Alibaba Cloud account

Go to alibabacloud.com and sign up with a valid email and phone number for verification. Use the international site unless you are explicitly targeting mainland China deployment.

2

Activate Model Studio

Open the Model Studio product page and click Activate, then accept the Terms of Service. This step also enables your free quota - 1 million input tokens plus 1 million output tokens, valid for 90 days on the Singapore (International) region.

3

Open the API Keys page

In the Model Studio console, find the sidebar item labelled API Keys (sometimes shown as Key Management).

4

Create and copy your key

Click Create API Key, optionally add a description to track which app it belongs to, then copy the key immediately. It starts with sk-. Store it in a password manager, a .env file, or your platform's secrets manager - and never commit it to a public Git repository.

Set your key as an environment variable so you never have to hardcode it:

# macOS / Linux - current session only
export DASHSCOPE_API_KEY="sk-your-key-here"

# Make it permanent - add to ~/.bashrc or ~/.zshrc
echo 'export DASHSCOPE_API_KEY="sk-your-key-here"' >> ~/.bashrc

# Windows PowerShell
$env:DASHSCOPE_API_KEY = "sk-your-key-here"
Regional keys are not interchangeable. A key created on the Singapore endpoint will fail authentication on the Beijing or US endpoint, and vice versa. If you hit a 401 error, a wrong base URL is more likely the cause than a bad key. Pick the region closest to your users and stick with it.

Part 2 ยท Pick your endpoint

Model Studio is deployed in four regions, each with its own endpoint and its own keys. Choose the one closest to your users - or whichever your compliance team approves - and use that base URL everywhere.

RegionBase URLBest for
Singapore
(International)
dashscope-intl.aliyuncs.com/compatible-mode/v1Default for non-China teams ยท has the free quota
US
(Virginia)
dashscope-us.aliyuncs.com/compatible-mode/v1Lowest latency for US teams
China
(Beijing)
dashscope.aliyuncs.com/compatible-mode/v1Mainland China deployments
Hong Kongcn-hongkong.dashscope.aliyuncs.com/compatible-mode/v1Hong Kong region

Part 3 ยท Install the SDK

For almost everyone, the OpenAI-compatible interface is the right choice - install the official openai package and you are done. Use a virtual environment to keep things tidy:

# (optional) create and activate a virtual environment
python -m venv venv
source venv/bin/activate        # Windows: venv\Scripts\activate

# install the OpenAI-compatible SDK
pip install openai

# OR - only if you need DashScope-specific features
# (batch invocation, advanced multimodal, real-time speech)
pip install dashscope
Which SDK? The OpenAI-compatible interface covers about 95% of use cases - standard chat, streaming, function calling, JSON mode, vision input. Only reach for the native dashscope package if you specifically need its extras like 50%-discounted batch invocation or real-time speech.
First requests

Your first Qwen API calls

With a key, an endpoint, and the SDK installed, every standard feature works exactly as it does with OpenAI.

Your first request

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1",
)

completion = client.chat.completions.create(
    model="qwen-plus",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Who are you?"}
    ]
)

print(completion.choices[0].message.content)

Streaming responses

For chatbot-style apps, stream tokens as they are generated:

stream = client.chat.completions.create(
    model="qwen-plus",
    messages=[{"role": "user", "content": "Explain async I/O simply."}],
    stream=True,
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Vision input (multimodal)

For image tasks, switch to a vision-capable model and add an image block to the message:

response = client.chat.completions.create(
    model="qwen-vl-plus",
    messages=[{
        "role": "user",
        "content": [
            {"type": "image_url",
             "image_url": {"url": "https://example.com/chart.png"}},
            {"type": "text", "text": "Extract this chart's data as JSON."}
        ]
    }]
)
print(response.choices[0].message.content)
That's it. Streaming, function calling, JSON mode, system prompts, multi-turn history, and image inputs all work through this same OpenAI-compatible client. To migrate an existing OpenAI project, you change only the base URL, the key, and the model name - nothing else.
Models & pricing

Choosing a model & what it costs

One of Qwen's strongest selling points is price - often an order of magnitude cheaper than GPT-4-class endpoints, with a free tier large enough to actually prototype on.

You access the whole Qwen catalogue through the same key and endpoint - you simply change the model string. A good default is a balanced general model; reach for cheaper high-volume models when latency and cost matter more than peak quality, a reasoning flagship for the hardest problems, and the specialists for code, vision, or math. Here is a snapshot of popular options on the International endpoint:

ModelInput $/MOutput $/MContextUse for
Qwen-Max$1.04$4.16262KReasoning-heavy work
Qwen-Plus$0.26$0.781MBest general default
Qwen-Turbo$0.05$0.201MHigh-volume, cheap
Qwen-Flashfrom $0.033from $0.131MTiered, lowest cost
Qwen-Coder~$0.30~$1.50128KProgramming
Qwen-VLvaries by modellargeVision / images

Two billing nuances are worth knowing. First, several models are priced in tiers based on the input size of each request - a short request and a near-maxed-out one fall into different rate brackets, so keeping requests lean saves money. Second, batch invocation gets a 50% discount on both input and output tokens, ideal for non-real-time workloads like overnight document processing or dataset labelling, at the cost of asynchronous (non-instant) results.

About the free tier. New Model Studio accounts on the Singapore region get 1M input + 1M output tokens free for 90 days - a combined cap shared across most Qwen models, enough to thoroughly prototype before you pay anything. It does not apply to the US or mainland China regions, and it expires whether or not you use it, so don't sit on it. Prices change often; always confirm current rates on the official DashScope pricing page before committing a production workload.
When things break

Troubleshooting common errors

Most access problems trace back to one of a small handful of causes - here's how to fix them fast.

401 Unauthorized

Almost always one of three things: the wrong base URL for your key's region (Singapore keys fail on Beijing endpoints), an environment variable that isn't actually loaded in your shell, or a sub-workspace key without model permissions. Check the URL first.

429 Too Many Requests

You've hit a rate limit - DashScope applies both requests-per-minute and requests-per-second caps. Implement exponential backoff in production, and request a quota increase from the Model Studio console once you have a billing history.

Key starts with sk- but still fails

Confirm the key is from the same region as your base URL, that Model Studio is actually activated on the account, and that you copied the full key without trailing spaces. Regenerate the key if in doubt.

Model not found / invalid model

The model string may be wrong or unavailable in your region - check the official model catalogue for exact names. Preview models in particular can have names that change, so verify against current docs.

Free quota exhausted unexpectedly

The 1M + 1M free allowance is a combined cap across most models and expires after 90 days. Reasoning models also emit far more output tokens than you might expect, which burns the output half quickly. Check usage in the console.

Can't reach the endpoint

The API can be called from anywhere - AWS, GCP, Azure, your laptop - so a connection failure is usually a local network or firewall issue, or a region your network can't reach. Try a different network to isolate it.

Where to look next. The official documentation lives on the Alibaba Cloud help center, and the QwenLM organization on GitHub hosts working code examples for every major model - often clearer than the formal docs when something is ambiguous. Bookmark the error-code reference page; it lists every 4xx/5xx code with remediation steps.
FAQ

Frequently asked questions

Do I need to install anything to use Qwen?
No. To chat with Qwen, just open chat.qwen.ai in any browser - no install, no account, no payment required. Installation only comes into play if you want a native desktop or mobile app, or if you're a developer setting up the API SDK. Those are optional paths, not prerequisites.
Is accessing Qwen free?
Qwen Chat is free with a generous daily limit and access to nearly every feature. The API has a free tier - 1 million input plus 1 million output tokens, valid for 90 days on the Singapore region - after which you pay only for the tokens you use. A paid Pro tier for the chat exists for unlimited usage, but it's optional.
Do I need an API key to use Qwen Chat?
No - these are completely separate. The web chat and the apps require no API key at all; you just sign in (or use guest mode). An API key is only for developers calling Qwen programmatically through Model Studio / DashScope.
How long does it take to get API access?
About five minutes. Create an Alibaba Cloud account, activate Model Studio (which enables the free quota), open the API Keys page, create a key, and set it as an environment variable. Then install the openai package and you can send your first request immediately.
Is the Qwen API really OpenAI-compatible?
Yes. It works with the official openai Python and JS SDKs, LangChain, LlamaIndex, LiteLLM, and any tool that speaks the OpenAI protocol. To migrate an existing project you change just three things - the base URL, the API key, and the model name. Streaming, function calling, and JSON mode all behave identically.
Why does my key return a 401 error?
Usually the base URL doesn't match your key's region - Singapore keys won't authenticate on Beijing or US endpoints. The other common causes are an environment variable that isn't loaded in your current shell, or a sub-workspace key lacking model permissions. Check the URL first, then the variable.
Can I access Qwen from outside Alibaba Cloud?
Yes. There's no requirement to use any other Alibaba Cloud service. You can call the API from AWS, GCP, Azure, your laptop, or anywhere else - all that matters is which regional endpoint you target and whether your network can reach it.
Which platforms have a Qwen app?
Qwen Chat runs on the web plus native apps for Windows, macOS, and Linux (AppImage), and mobile apps for iOS and Android (with a direct APK option). There's also a Chrome extension. The same account syncs your conversations across all of them.
Are my conversations used to train Qwen?
In Qwen Chat you can toggle "Improve the model for everyone" off in privacy settings to opt out. For the API on standard paid tiers, requests are not used for training by default per the Model Studio Terms of Service; sensitive use cases can request a Data Processing Agreement through Alibaba Cloud sales.