thuml

16 models • 1 total models in database
Sort by:

sundial-base-128m

This model specializes in time series forecasting, utilizing 128 million parameters. It is trained on diverse datasets, including UTSD and Salesforce's large datasets, to provide accurate predictions for various time-dependent data.

license:apache-2.0
1,188,486
70

timer-base-84m

license:apache-2.0
5,540
59

rt1-frame-tokenizer

license:mit
27
0

rt1-compressive-tokenizer

license:mit
20
0

rt1-world-model-single-step-rlvr

See https://github.com/thuml/RLVR-World for examples for using this model.

NaNK
llama
17
0

rt1-world-model-multi-step-base

llama
5
0

Thoth-30B-A3B

NaNK
license:apache-2.0
2
1

rt1-world-model-multi-step-rlvr

See https://github.com/thuml/RLVR-World for examples for using this model.

NaNK
llama
2
0

rt1-world-model-single-step-base

See https://github.com/thuml/RLVR-World for examples for using this model.

llama
2
0

bytesized32-world-model-rlvr-task-specific-reward

See https://github.com/thuml/RLVR-World for examples for using this model. ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, }

NaNK
license:mit
2
0

webarena-world-model-sft

See https://github.com/thuml/RLVR-World for examples for using this model. ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, }

NaNK
license:mit
1
0

webarena-world-model-rlvr

See https://github.com/thuml/RLVR-World for examples for using this model. ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, }

NaNK
license:mit
1
0

bytesized32-world-model-sft

NaNK
license:mit
1
0

bytesized32-world-model-rlvr-binary-reward

See https://github.com/thuml/RLVR-World for examples for using this model. ``` @article{wu2025rlvr, title={RLVR-World: Training World Models with Reinforcement Learning}, author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, journal={arXiv preprint arXiv:2505.13934}, year={2025}, }

NaNK
license:mit
1
0

MiniVeo3-Reasoner-Maze-5B

NaNK
license:mit
0
3

ivideogpt-oxe-64-act-free

license:mit
0
2