ibm-research

314 models • 1 total models in database

Sort by:

flowstate

PowerMoE-3b

--- pipeline_tag: text-generation inference: false license: apache-2.0 library_name: transformers model-index: - name: ibm/PowerMoE-3b results: - task: type: text-generation dataset: type: lm-eval-harness name: ARC metrics: - name: accuracy-norm type: accuracy-norm value: 58.1 verified: false - task: type: text-generation dataset: type: lm-eval-harness name: BoolQ metrics: - name: accuracy type: accuracy value: 65.0 verified: false - task: type: text-generation dataset: type: lm-eval-harness nam

NaNK

license:apache-2.0

277,959

MoLFormer-XL-both-10pct

--- license: apache-2.0 library_name: transformers pipeline_tag: feature-extraction tags: - chemistry ---

license:apache-2.0

181,284

materials.selfies-ted

selfies-ted is an transformer based encoder decoder model for molecular representations using SELFIES. Paper: - SELFIES-TED : A Robust Transformer Model for Molecular Representation using SELFIES - SELF-BART : A Transformer-based Molecular Representation Model using SELFIES

license:apache-2.0

61,092

CTI-BERT

CTI-BERT is a pre-trained language model for the cybersecurity domain. The model was trained on a large corpus of security-related text data, comprising approximately 1.2 billion tokens sourced from a diverse range of sources, including security news articles, vulnerability descriptions, books, academic publications, and security-related Wikipedia pages. For additional technical details and the model's performance metrics, please refer to this paper. This model has a vocabulary of 50,000 tokens and the sequence length of 256. Both the tokenizer and the BERT model were trained from scratch using the runmlm script with the Masked language modeling (MLM) objective. You can use the model for masked language modeling or token embedding generation, but the model is aimed at being fine-tuned on a downstream task, such as sequence classification, text classification or question answering. The model has shown improved performance for various cybersecurity text classification. However, it is not designed to be used as the main model for general-domain text. The following hyperparameters were used during training: - learningrate: 0.0005 - trainbatchsize: 128 - evalbatchsize: 128 - seed: 42 - gradientaccumulationsteps: 16 - totaltrainbatchsize: 2048 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06 - lrschedulertype: linear - lrschedulerwarmupsteps: 10000 - trainingsteps: 200000 - Transformers 4.18.0 - Pytorch 1.12.1+cu102 - Datasets 2.4.0 - Tokenizers 0.12.1

—

3,818

test-patchtst

license:apache-2.0

1,431

test-ttm-v1

license:apache-2.0

1,270

ttm-research-r2

license:cc-by-nc-sa-4.0

1,230

test-patchtsmixer

license:apache-2.0

1,020

granite-3.2-8b-instruct-GGUF

NaNK

license:apache-2.0

985

granite-3.2-2b-instruct-GGUF

NaNK

license:apache-2.0

888

GP-MoLFormer-Uniq

license:apache-2.0

883

granite-vision-3.2-2b-GGUF

NaNK

license:apache-2.0

757

smxm

—

428

testing-patchtst_etth1_pretrain

—

370

patchtsmixer-etth1-pretrain

—

299

patchtst-etth1-regression-distribution

—

295

patchtsmixer-etth1-generate

—

289

re2g-reranker-nq

license:apache-2.0

216

materials.mhg-ged

This repository provides PyTorch source code assosiated with our publication, "MHG-GNN: Combination of Molecular Hypergraph Grammar with Graph Neural Network" We present MHG-GNN, an autoencoder architecture that has an encoder based on GNN and a decoder based on a sequential model with MHG. Since the encoder is a GNN variant, MHG-GNN can accept any molecule as input, and demonstrate high predictive performance on molecular graph data. In addition, the decoder inherits the theoretical guarantee of MHG on always generating a structurally valid molecule as output. 1. Getting Started 1. Pretrained Models and Training Logs 2. Installation 2. Feature Extraction This code and environment have been tested on Intel E5-2667 CPUs at 3.30GHz and NVIDIA A100 Tensor Core GPUs. We provide checkpoints of the MHG-GNN model pre-trained on a dataset of ~1.34M molecules curated from PubChem. (later) For model weights: [HuggingFace Link]() Add the MHG-GNN `pre-trained weights.pt` to the `models/` directory according to your needs. We recommend to create a virtual environment. For example: Type the following command once the virtual environment is activated: The example notebook mhg-gnnencoderdecoderexample.ipynb contains code to load checkpoint files and use the pre-trained model for encoder and decoder tasks. For decoder, you can use the function, so you can return from embeddings to SMILES strings:

license:apache-2.0

208

materials.pos-egnn

license:apache-2.0

170

granite-guardian-3.2-3b-a800m-GGUF

NaNK

license:apache-2.0

161

ibm-research

flowstate

PowerMoE-3b

MoLFormer-XL-both-10pct

materials.selfies-ted

materials.smi-ted

ttm-r3

patchtst-fm-r1

PowerLM-3b

regen-disambiguation

moe-7b-1b-active-shared-experts

CTI-BERT

test-patchtst

test-ttm-v1

ttm-research-r2

test-patchtsmixer

granite-3.2-8b-instruct-GGUF

granite-3.2-2b-instruct-GGUF

GP-MoLFormer-Uniq

granite-vision-3.2-2b-GGUF

smxm

testing-patchtst_etth1_pretrain

patchtsmixer-etth1-pretrain

patchtst-etth1-regression-distribution

patchtsmixer-etth1-generate

re2g-reranker-nq

materials.mhg-ged

materials.pos-egnn

granite-guardian-3.2-3b-a800m-GGUF

patchtst-etth1-pretrain

gpt2-medium-multiexit

ColD-Fusion

biomed.sm.mv-te-84m

merlinite-7b

merlinite-7b-GGUF

materials.selfies-ted2m

testing-patchtst_etth1_forecast

qcpg-sentences

knowgl-large

biomed.omics.bl.sm.ma-ted-458m

materials.3dgrid_vqgan

materials.smi_ssed

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-BACE-101

biomed.rna.llama.47m.wced.multitask.v1

granite-guardian-3.2-5b-GGUF

biomed.omics.bl.sm.ma-ted-458m.tcr_epitope_bind

biomed.rna.llama.32m.mlm.multitask.v1

trajcast.models-arxiv2025

testing-patchtst_etth1_regression

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-CLINTOX-101

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-LIPOPHILICITY-101

ColD-Fusion-itr12-seed1

roberta-large-vira-intents

ColD-Fusion-itr19-seed1

ColD-Fusion-bert-base-uncased-itr11-seed0

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-HIV-101

ColD-Fusion-itr21-seed3

ColD-Fusion-bert-base-uncased-itr9-seed0

ColD-Fusion-bert-base-uncased-itr15-seed0

ColD-Fusion-bert-base-uncased-itr17-seed0

ColD-Fusion-bert-base-uncased-itr27-seed0

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-QM7-101

ColD-Fusion-itr9-seed0

ColD-Fusion-itr11-seed2

ColD-Fusion-itr15-seed3

ColD-Fusion-bert-base-uncased-itr10-seed0

ColD-Fusion-bert-base-uncased-itr12-seed0

ColD-Fusion-bert-base-uncased-itr13-seed0

ColD-Fusion-bert-base-uncased-itr1-seed0

ColD-Fusion-bert-base-uncased-itr20-seed0

ColD-Fusion-bert-base-uncased-itr23-seed0

ColD-Fusion-bert-base-uncased-itr26-seed0

ColD-Fusion-bert-base-uncased-itr4-seed0

biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-FREESOLV-101

ColD-Fusion-itr11-seed4

ColD-Fusion-itr13-seed0

ColD-Fusion-itr15-seed4

ColD-Fusion-itr16-seed3

ColD-Fusion-itr18-seed2

ColD-Fusion-itr27-seed1