TreeVGR-7B
155
4
license:apache-2.0
by
HaochenWang
Image Model
OTHER
7B params
New
155 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
16GB+ RAM
Mobile
Laptop
Server
Quick Summary
This repository contains the TreeVGR-7B model, as presented in the paper Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology.
Device Compatibility
Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
7GB+ RAM
Code Examples
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vInstallationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -vUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pyUsagebash
python3 inference_treebench.pytext
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3Deploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.