🧠

开源LLM

📂 llm ⏱ 1 min 143 words

--- title: "开源LLM" description: "开源大语言模型生态综述，涵盖Llama、Qwen、Mistral等主流模型" tags: ["开源LLM", "Llama", "Qwen", "Mistral", "LLM生态"] category: "llm" icon: "🧠"

开源LLM

开源大语言模型是指模型权重、训练代码或训练数据公开可用的LLM。开源生态的蓬勃发展打破了闭源模型的垄断，推动了AI技术的民主化，让企业和研究者能够自由地使用、修改和部署大语言模型。

主流开源模型

Llama系列

Meta推出的Llama系列是目前生态最丰富的开源模型。Llama 2提供了7B到70B参数的多种规格，并开放商用许可。Llama 3进一步扩展到405B参数，性能大幅提升，在多项基准测试中接近闭源模型水平。

Qwen系列

阿里巴巴推出的Qwen系列支持多语言，在中英文任务上表现出色。Qwen2.5提供了从0.5B到72B的多种规格，覆盖端侧部署到大规模推理的全场景需求。

Mistral系列

Mistral AI以效率著称，Mistral 7B引入滑动窗口注意力机制，在小参数量下实现优异性能。Mixtral 8x7B是开源MoE模型的代表，以较低的计算成本获得接近大模型的性能。

其他重要模型

DeepSeek以高性价比著称，Phi系列证明了小模型也能有强大能力，Gemma是Google的轻量级开源模型，Yi系列在中文任务上表现突出。

使用开源模型

通过Hugging Face加载模型

from transformers import AutoTokenizer, AutoModelForCausalLM

model_name = "Qwen/Qwen2.5-7B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto",
    trust_remote_code=True
)

messages = [
    {"role": "user", "content": "用Python实现快速排序算法"}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer([text], return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

通过Ollama本地运行

# 安装Ollama后，拉取并运行模型
ollama pull qwen2.5:7b
ollama run qwen2.5:7b

# 通过API调用
curl http://localhost:11434/api/chat -d '{
  "model": "qwen2.5:7b",
  "messages": [{"role": "user", "content": "解释量子计算的基本原理"}],
  "stream": false
}'

使用vLLM高性能推理

from vllm import LLM, SamplingParams

llm = LLM(model="Qwen/Qwen2.5-7B-Instruct", tensor_parallel_size=2)
sampling_params = SamplingParams(temperature=0.7, max_tokens=512)

prompts = [
    "请介绍一下中国的四大发明",
    "用Python写一个HTTP服务器",
    "解释什么是机器学习"
]
outputs = llm.generate(prompts, sampling_params)
for output in outputs:
    print(output.outputs[0].text)

开源生态工具

Hugging Face提供模型托管和训练平台，llama.cpp支持跨平台CPU推理，vLLM专注高吞吐推理服务，Ollama让本地模型运行变得简单。这些工具构成了完整的开源LLM工具链。

开源的价值

可审计性：检查模型行为和偏见
可定制性：根据需求修改和微调
数据隐私：本地运行保护数据安全
成本控制：避免API调用费用
研究推动：促进学术研究和技术创新

挑战与趋势

开源LLM面临算力成本、安全治理和商业可持续性等挑战。未来趋势包括模型效率提升、端侧部署优化，以及开源与闭源的混合模式。随着技术进步，开源模型正在缩小与闭源模型的差距。

﻿--- title: "开源LLM" description: "开源大语言模型生态综述，涵盖Llama、Qwen、Mistral等主流模型" tags: ["开源LLM", "Llama", "Qwen", "Mistral", "LLM生态"] category: "llm" icon: "🧠"

开源LLM

主流开源模型

Llama系列

Qwen系列

Mistral系列

其他重要模型

使用开源模型

通过Hugging Face加载模型

通过Ollama本地运行

使用vLLM高性能推理

开源生态工具

开源的价值

挑战与趋势

--- title: "开源LLM" description: "开源大语言模型生态综述，涵盖Llama、Qwen、Mistral等主流模型" tags: ["开源LLM", "Llama", "Qwen", "Mistral", "LLM生态"] category: "llm" icon: "🧠"