Windows安装 OmniParser 2 屏幕解析模型

一、环境准备

1.安装 Python 建议使用版本：3.12

2.安装 Anaconda

Shell Session

winget install --Id Anaconda.Anaconda3

1.克隆项目

Shell Session

git clone <https://github.com/microsoft/OmniParser>
cd OmniParser

2.创建conda环境

Shell Session

conda create -n "omni" python==3.12
conda activate omni
pip install -r requirements.txt

3.下载视觉模型

Shell Session

git clone <https://huggingface.co/microsoft/OmniParser-v2.0> weights

4.修改模型路径

Python

"microsoft/Florence-2-base"  # 或huggingface其他模型

5.启动演示

Shell Session

python .\\gradio_demo.py

1.确保已安装CUDA驱动（未安装则默认使用CPU）
2.AMD显卡需确认ROCm环境配置
3.如遇路径错误，建议使用绝对路径
4.建议通过conda单独安装PyTorch：

Shell Session

conda install pytorch torchvision torchaudio -c pytorch

1.移除了冗余的clean命令（实测安装过程无需清理）
2.补充了AMD显卡的注意事项
3.添加了PyTorch独立安装建议
4.优化了路径说明，避免环境变量错误

遇到问题可优先检查：