DOC: small fixes for doc (#3294)

qinxuye · web-flow · commit 2010508314f6 · 2025-04-19T23:50:41.000+08:00
diff --git a/doc/source/getting_started/installation.rst b/doc/source/getting_started/installation.rst
@@ -61,6 +61,8 @@ Currently, supported models include:
 - ``QwQ-32B-Preview``, ``QwQ-32B``
 - ``marco-o1``
 - ``fin-r1``
+- ``seallms-v3``
+- ``skywork-or1-preview``
 - ``gemma-it``, ``gemma-2-it``, ``gemma-3-1b-it``
 - ``orion-chat``, ``orion-chat-rag``
 - ``c4ai-command-r-v01``
diff --git a/doc/source/locale/zh_CN/LC_MESSAGES/models/virtualenv.po b/doc/source/locale/zh_CN/LC_MESSAGES/models/virtualenv.po
@@ -8,7 +8,7 @@ msgid ""
 msgstr ""
 "Project-Id-Version: Xinference \n"
 "Report-Msgid-Bugs-To: \n"
-"POT-Creation-Date: 2025-04-19 00:37+0800\n"
+"POT-Creation-Date: 2025-04-19 14:42+0000\n"
 "PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n"
 "Last-Translator: FULL NAME <EMAIL@ADDRESS>\n"
 "Language: zh_CN\n"
@@ -17,7 +17,7 @@ msgstr ""
 "MIME-Version: 1.0\n"
 "Content-Type: text/plain; charset=utf-8\n"
 "Content-Transfer-Encoding: 8bit\n"
-"Generated-By: Babel 2.14.0\n"
+"Generated-By: Babel 2.16.0\n"
 
 #: ../../source/models/virtualenv.rst:5
 msgid "Model Virtual Environments"
@@ -41,9 +41,10 @@ msgid ""
 "latest version of ``transformers``. This version mismatch leads to "
 "dependency conflicts."
 msgstr ""
-"一些模型在发布后不再维护，其依赖的库版本也保持在较旧的状态。例如，``GOT-OCR2`` 模型仍依赖于 "
-"``transformers`` 4.37.2。如果将该库升级为新版本，模型将无法正常运行；"
-"而许多新模型又需要最新版本的 ``transformers``。这种版本差异会导致依赖冲突。"
+"一些模型在发布后不再维护，其依赖的库版本也保持在较旧的状态。例如，``GOT-"
+"OCR2`` 模型仍依赖于 ``transformers`` 4.37.2。如果将该库升级为新版本，模型"
+"将无法正常运行；而许多新模型又需要最新版本的 ``transformers``。这种版本"
+"差异会导致依赖冲突。"
 
 #: ../../source/models/virtualenv.rst:20
 msgid "Solution"
@@ -56,64 +57,76 @@ msgid ""
 msgstr "为了解决这个问题，我们引入了 **模型虚拟空间** 功能。"
 
 #: ../../source/models/virtualenv.rst:24
+msgid "Install requirements for this functionality via"
+msgstr "通过以下命令安装该功能所需的依赖"
+
+#: ../../source/models/virtualenv.rst:33
 msgid ""
 "Enable by setting environment variable "
 "``XINFERENCE_ENABLE_VIRTUAL_ENV=1``."
 msgstr "通过设置环境变量 ``XINFERENCE_ENABLE_VIRTUAL_ENV=1`` 启用该功能。"
 
-#: ../../source/models/virtualenv.rst:26
+#: ../../source/models/virtualenv.rst:35
 msgid "Example usage:"
 msgstr "使用示例："
 
-#: ../../source/models/virtualenv.rst:38
+#: ../../source/models/virtualenv.rst:47
 msgid "This feature requires internet access or a self-hosted PyPI mirror."
 msgstr "该功能需要联网，或使用自建的 PyPI 镜像服务。"
 
-#: ../../source/models/virtualenv.rst:42
+#: ../../source/models/virtualenv.rst:49
+msgid "Xinference will by default inherit the config for current pip."
+msgstr "Xinference 默认会继承当前 pip 的配置。"
+
+#: ../../source/models/virtualenv.rst:53
 msgid ""
 "The model virtual environment feature is disabled by default (i.e., "
 "XINFERENCE_ENABLE_VIRTUAL_ENV is set to 0)."
 msgstr ""
-"模型虚拟空间功能默认处于关闭状态（即 ``XINFERENCE_ENABLE_VIRTUAL_ENV`` 的默认值为 0）。"
+"模型虚拟空间功能默认处于关闭状态（即 ``XINFERENCE_ENABLE_VIRTUAL_ENV`` 的"
+"默认值为 0）。"
 
-#: ../../source/models/virtualenv.rst:44
+#: ../../source/models/virtualenv.rst:55
 msgid "It will be enabled by default starting from Xinference v2.0.0."
 msgstr "该功能将在 Xinference v2.0.0 起默认开启。"
 
-#: ../../source/models/virtualenv.rst:46
+#: ../../source/models/virtualenv.rst:57
 msgid ""
 "When enabled, Xinference will automatically create a dedicated virtual "
 "environment for each model when it is loaded, and install its specific "
 "dependencies there. This prevents dependency conflicts between models, "
 "allowing them to run in isolation without affecting one another."
 msgstr ""
-"启用该功能后，Xinference 会在加载模型时自动为其创建专属的虚拟环境，并在其中安装对应依赖。"
-"这可避免模型之间的依赖冲突，确保各模型在相互隔离的环境中独立运行。"
+"启用该功能后，Xinference 会在加载模型时自动为其创建专属的虚拟环境，并在"
+"其中安装对应依赖。这可避免模型之间的依赖冲突，确保各模型在相互隔离的环境"
+"中独立运行。"
 
-#: ../../source/models/virtualenv.rst:51
+#: ../../source/models/virtualenv.rst:62
 msgid "Supported Models"
 msgstr "支持的模型"
 
-#: ../../source/models/virtualenv.rst:53
+#: ../../source/models/virtualenv.rst:64
 msgid "Currently, this feature supports the following models:"
 msgstr "当前，该功能支持以下模型："
 
-#: ../../source/models/virtualenv.rst:55
+#: ../../source/models/virtualenv.rst:66
 msgid ":ref:`GOT-OCR2 <models_builtin_got-ocr2_0>`"
 msgstr ":ref:`GOT-OCR2 <models_builtin_got-ocr2_0>`"
 
-#: ../../source/models/virtualenv.rst:56
+#: ../../source/models/virtualenv.rst:67
 msgid ":ref:`Qwen2.5-omni <models_llm_qwen2.5-omni>`"
 msgstr ":ref:`Qwen2.5-omni <models_llm_qwen2.5-omni>`"
 
-#: ../../source/models/virtualenv.rst:59
+#: ../../source/models/virtualenv.rst:70
 msgid "Storage Location"
 msgstr "存储位置"
 
-#: ../../source/models/virtualenv.rst:61
+#: ../../source/models/virtualenv.rst:72
 msgid ""
 "By default, the model’s virtual environment is stored under path: :ref:`"
 "XINFERENCE_HOME <environments_xinference_home>` / virtualenv / {model_"
 "name}"
 msgstr ""
-"默认情况下，模型的虚拟环境存储在以下路径：:ref:`XINFERENCE_HOME <environments_xinference_home>` / virtualenv / {model_name}"
+"默认情况下，模型的虚拟环境存储在以下路径：:ref:`XINFERENCE_HOME <"
+"environments_xinference_home>` / virtualenv / {model_name}"
+
diff --git a/doc/source/models/builtin/llm/internvl3.rst b/doc/source/models/builtin/llm/internvl3.rst
@@ -20,7 +20,7 @@ Model Spec 1 (pytorch, 1 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 1
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-1B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-1B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-1B>`__
 
@@ -36,7 +36,7 @@ Model Spec 2 (awq, 1 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 1
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-1B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-1B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-1B-AWQ>`__
 
@@ -52,7 +52,7 @@ Model Spec 3 (pytorch, 2 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 2
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-2B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-2B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-2B>`__
 
@@ -68,7 +68,7 @@ Model Spec 4 (awq, 2 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 2
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-2B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-2B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-2B-AWQ>`__
 
@@ -84,7 +84,7 @@ Model Spec 5 (pytorch, 8 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 8
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-8B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-8B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-8B>`__
 
@@ -100,7 +100,7 @@ Model Spec 6 (awq, 8 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 8
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-8B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-8B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-8B-AWQ>`__
 
@@ -116,7 +116,7 @@ Model Spec 7 (pytorch, 9 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 9
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-9B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-9B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-9B>`__
 
@@ -132,7 +132,7 @@ Model Spec 8 (awq, 9 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 9
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-9B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-9B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-9B-AWQ>`__
 
@@ -148,7 +148,7 @@ Model Spec 9 (pytorch, 14 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 14
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-14B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-14B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-14B>`__
 
@@ -164,7 +164,7 @@ Model Spec 10 (awq, 14 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 14
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-14B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-14B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-14B-AWQ>`__
 
@@ -180,7 +180,7 @@ Model Spec 11 (pytorch, 38 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 38
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-38B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-38B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-38B>`__
 
@@ -196,7 +196,7 @@ Model Spec 12 (awq, 38 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 38
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-38B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-38B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-38B-AWQ>`__
 
@@ -212,7 +212,7 @@ Model Spec 13 (pytorch, 78 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 78
 - **Quantizations:** 8-bit, none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-78B
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-78B>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-78B>`__
 
@@ -228,7 +228,7 @@ Model Spec 14 (awq, 78 Billion)
 - **Model Format:** awq
 - **Model Size (in billions):** 78
 - **Quantizations:** Int4
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** OpenGVLab/InternVL3-78B-AWQ
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/OpenGVLab/InternVL3-78B-AWQ>`__, `ModelScope <https://modelscope.cn/models/OpenGVLab/InternVL3-78B-AWQ>`__
 
diff --git a/doc/source/models/builtin/llm/seallms-v3.rst b/doc/source/models/builtin/llm/seallms-v3.rst
@@ -20,7 +20,7 @@ Model Spec 1 (pytorch, 1_5 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 1_5
 - **Quantizations:** none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** SeaLLMs/SeaLLMs-v3-1.5B-Chat
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/SeaLLMs/SeaLLMs-v3-1.5B-Chat>`__, `ModelScope <https://modelscope.cn/models/SeaLLMs/SeaLLMs-v3-1.5B-Chat>`__
 
@@ -36,7 +36,7 @@ Model Spec 2 (pytorch, 7 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 7
 - **Quantizations:** none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** SeaLLMs/SeaLLMs-v3-7B-Chat
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/SeaLLMs/SeaLLMs-v3-7B-Chat>`__, `ModelScope <https://modelscope.cn/models/SeaLLMs/SeaLLMs-v3-7B-Chat>`__
 
diff --git a/doc/source/models/builtin/llm/skywork-or1-preview.rst b/doc/source/models/builtin/llm/skywork-or1-preview.rst
@@ -20,7 +20,7 @@ Model Spec 1 (pytorch, 32 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 32
 - **Quantizations:** none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** Skywork/Skywork-OR1-32B-Preview
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/Skywork/Skywork-OR1-32B-Preview>`__, `ModelScope <https://modelscope.cn/models/Skywork/Skywork-OR1-32B-Preview>`__
 
@@ -36,7 +36,7 @@ Model Spec 2 (gptq, 32 Billion)
 - **Model Format:** gptq
 - **Model Size (in billions):** 32
 - **Quantizations:** Int4, int8
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** JunHowie/Skywork-OR1-32B-Preview-GPTQ-{quantization}
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/JunHowie/Skywork-OR1-32B-Preview-GPTQ-{quantization}>`__, `ModelScope <https://modelscope.cn/models/JunHowie/Skywork-OR1-32B-Preview-GPTQ-{quantization}>`__
 
@@ -52,7 +52,7 @@ Model Spec 3 (pytorch, 7 Billion)
 - **Model Format:** pytorch
 - **Model Size (in billions):** 7
 - **Quantizations:** none
-- **Engines**: Transformers
+- **Engines**: vLLM, Transformers
 - **Model ID:** Skywork/Skywork-OR1-7B-Preview
 - **Model Hubs**:  `Hugging Face <https://huggingface.co/Skywork/Skywork-OR1-7B-Preview>`__, `ModelScope <https://modelscope.cn/models/Skywork/Skywork-OR1-7B-Preview>`__
 
diff --git a/doc/source/models/virtualenv.rst b/doc/source/models/virtualenv.rst
@@ -21,6 +21,15 @@ Solution
 
 To address this issue, we have introduced the **Model Virtual Environment** feature.
 
+Install requirements for this functionality via
+
+.. code-block:: bash
+
+    # all
+    pip install 'xinference[all]'
+    # or virtualenv
+    pip install 'xinference[virtualenv]'
+
 Enable by setting environment variable ``XINFERENCE_ENABLE_VIRTUAL_ENV=1``.
 
 Example usage:
@@ -37,6 +46,8 @@ Example usage:
 
   This feature requires internet access or a self-hosted PyPI mirror.
 
+  Xinference will by default inherit the config for current pip.
+
 .. note::
 
   The model virtual environment feature is disabled by default (i.e., XINFERENCE_ENABLE_VIRTUAL_ENV is set to 0).
diff --git a/doc/source/user_guide/backends.rst b/doc/source/user_guide/backends.rst
@@ -78,6 +78,8 @@ Currently, supported model includes:
 - ``QwQ-32B-Preview``, ``QwQ-32B``
 - ``marco-o1``
 - ``fin-r1``
+- ``seallms-v3``
+- ``skywork-or1-preview``
 - ``gemma-it``, ``gemma-2-it``, ``gemma-3-1b-it``
 - ``orion-chat``, ``orion-chat-rag``
 - ``c4ai-command-r-v01``
diff --git a/setup.cfg b/setup.cfg
@@ -274,6 +274,8 @@ doc =
     timm
 benchmark =
     psutil
+virtualenv =
+    uv
 
 [options.entry_points]
 console_scripts =
diff --git a/xinference/model/llm/llm_family.json b/xinference/model/llm/llm_family.json
@@ -7946,7 +7946,9 @@
     "virtualenv": {
       "packages": [
         "git+https://github.com/huggingface/transformers@v4.51.3-Qwen2.5-Omni-preview",
-        "numpy==1.26.4"
+        "numpy==1.26.4",
+        "qwen_omni_utils",
+        "soundfile"
       ]
     }
   },
diff --git a/xinference/model/llm/llm_family_modelscope.json b/xinference/model/llm/llm_family_modelscope.json
@@ -5704,7 +5704,9 @@
     "virtualenv": {
       "packages": [
         "git+https://github.com/huggingface/transformers@v4.51.3-Qwen2.5-Omni-preview",
-        "numpy==1.26.4"
+        "numpy==1.26.4",
+        "qwen_omni_utils",
+        "soundfile"
       ]
     }
   },

Original file line number	Diff line number	Diff line change
`@@ -7946,7 +7946,9 @@`
`7946`	`7946`	`"virtualenv": {`
`7947`	`7947`	`"packages": [`
`7948`	`7948`	`"git+https://github.com/huggingface/[email protected]",`
`7949`		`- "numpy==1.26.4"`
	`7949`	`+ "numpy==1.26.4",`
	`7950`	`+ "qwen_omni_utils",`
	`7951`	`+ "soundfile"`
`7950`	`7952`	`]`
`7951`	`7953`	`}`
`7952`	`7954`	`},`
Original file line number	Diff line number	Diff line change
`@@ -5704,7 +5704,9 @@`
`5704`	`5704`	`"virtualenv": {`
`5705`	`5705`	`"packages": [`
`5706`	`5706`	`"git+https://github.com/huggingface/[email protected]",`
`5707`		`- "numpy==1.26.4"`
	`5707`	`+ "numpy==1.26.4",`
	`5708`	`+ "qwen_omni_utils",`
	`5709`	`+ "soundfile"`
`5708`	`5710`	`]`
`5709`	`5711`	`}`
`5710`	`5712`	`},`