TreeVGR-7B

155
4
license:apache-2.0
by
HaochenWang
Image Model
OTHER
7B params
New
155 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
16GB+ RAM
Mobile
Laptop
Server
Quick Summary

This repository contains the TreeVGR-7B model, as presented in the paper Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
7GB+ RAM

Code Examples

Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Installationbash
git clone https://github.com/Haochen-Wang409/TreeVGR
cd TreeVGR
pip3 install -r requirements.txt
pip3 install flash-attn --no-build-isolation -v
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
Usagebash
python3 inference_treebench.py
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3
text
Perception/Attributes 18/29=62.07
Perception/Material 7/13=53.85
Perception/Physical State 19/23=82.61
Perception/Object Retrieval 10/16=62.5
Perception/OCR 42/68=61.76
Reasoning/Perspective Transform 19/85=22.35
Reasoning/Ordering 20/57=35.09
Reasoning/Contact and Occlusion 25/41=60.98
Reasoning/Spatial Containment 20/29=68.97
Reasoning/Comparison 20/44=45.45
==> Overall 200/405=49.38
==> Mean IoU: 43.3

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.