Capybara
197
license:mit
by
xgen-universe
Other
OTHER
New
0 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
Unknown
Mobile
Laptop
Server
Quick Summary
AI model with specialized capabilities.
Training Data Analysis
šµ Good (6.0/10)
Researched training datasets used by Capybara with quality assessment
Specialized For
general
multilingual
Training Datasets (1)
c4
šµ 6/10
general
multilingual
Key Strengths
- ā¢Scale and Accessibility: 750GB of publicly available, filtered text
- ā¢Systematic Filtering: Documented heuristics enable reproducibility
- ā¢Language Diversity: Despite English-only, captures diverse writing styles
Considerations
- ā¢English-Only: Limits multilingual applications
- ā¢Filtering Limitations: Offensive content and low-quality text remain despite filtering
Explore our comprehensive training dataset analysis
View All DatasetsCode Examples
š ļø Installationbash
# Clone the repository
git clone https://github.com/xgen-universe/Capybara.git
cd Capybara
# Create environment
conda create -n capybara python=3.11 -y
conda activate capybara
# Install pytorch (torch 2.6.0 with CUDA 12.6)
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu126
# Install dependencies
pip install -r requirements.txt
# [Optional] Install Flash Attention for faster inference
pip install flash_attn --no-build-isolationDeploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.