zjunlp

79 models • 2 total models in database

Sort by:

SafeEdit-Safety-Classifier

MolGen-large

MolGen-large MolGen-large was introduced in the paper "Domain-Agnostic Molecular Generation with Self-feedback" and first released in this repository. It is a pre-trained molecular generative model built using the 100\% robust molecular language representation, SELFIES. Model description MolGen-large is the first pre-trained model that only produces chemically valid molecules. With a training corpus of over 100 million molecules in SELFIES representation, MolGen-large learns the intrinsic structural patterns of molecules by mapping corrupted SELFIES to their original forms. Specifically, MolGen-large employs a bidirectional Transformer as its encoder and an autoregressive Transformer as its decoder. Through its carefully designed multi-task molecular prefix tuning (MPT), MolGen-large can generate molecules with desired properties, making it a valuable tool for molecular optimization. Intended uses You can use the raw model for molecule generation or fine-tune it to a downstream task. Please take note that the following examples only demonstrate the utilization of our pre-trained model for molecule generation. See the repository to look for fine-tune details on a task that interests you.

—

244

OneKE

OneKE: A Bilingual Large Language Model for Knowledge Extraction - What is OneKE? - How is OneKE trained? - Getting Started with OneKE - Quick Start - Advanced Use of OneKE - OneKE Instruction Format - Conversion of OneKE Instruction Format - Customized Schema Description Instructions - Evaluation - Continue Training - Citation OneKE is a large-scale model framework for knowledge extraction jointly developed by Ant Group and Zhejiang University. It possesses the capability of generalized knowledge extraction in bilingual Chinese and English, across multiple domains and tasks, and provides comprehensive toolchain support. OneKE has contributed to the OpenKG open knowledge graph community in an open-source manner. Knowledge construction based on unstructured documents has always been one of the key challenges for the large-scale implementation of knowledge graphs. The high fragmentation and unstructured nature of real-world information, along with the substantial disparities between extracted content and its natural language expression, often result in the suboptimal performance of large language models in information extraction tasks. Natural language text often contains ambiguities, polysemies, and metaphors due to implicit and long-distance context associations, posing significant challenges for knowledge extraction tasks. In response to these issues, Ant Group and Zhejiang University leveraged their years of expertise in knowledge graphs and natural language processing to jointly construct and upgrade the capabilities of Ant's large-scale model "BaiLing" in the field of knowledge extraction. They released the bilingual knowledge extraction framework OneKE which included a version based on full parametric fine-tuning of Chinese-Alpaca-2-13B. Evaluation metrics show that OneKE has achieved relatively good performance on several fully supervised and zero-shot entity/relation/event extraction tasks. The unified knowledge extraction framework has wide application scenarios and can significantly reduce the construction costs of domain-specific knowledge graphs. By extracting structured knowledge from massive datasets to construct high-quality knowledge graphs and establish logical associations between knowledge elements, interpretable inference and decision-making can be realized. It can also enhance large models by mitigating hallucination and boosting stability, accelerating the vertical domain applications of large models. For example, in the medical field, knowledge extraction can be used to convert doctors' experience into structured, rule-based management, building controlled auxiliary diagnostics, and medical Q&A systems. In the financial sector, it can extract financial indicators, risk events, causal logic, and industry chains for automated financial report generation, risk prediction, and industry chain analysis. In the public sector, it can facilitate knowledge-based management of government regulations, enhancing the efficiency and accuracy of public services. How is OneKE trained? OneKE mainly focuses on schema-generalizable information extraction. Due to issues such as non-standard formats, noisy data, and lack of diversity in existing extraction instruction data, OneKE adopted techniques such as normalization and cleaning of extraction instructions, difficult negative sample collection, and schema-based batched instruction construction, as shown in the illustration. For more detailed information, refer to the paper "IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus [Github]". The zero-shot generalization comparison results of OneKE with other large models are as follows: `NER-en`: CrossNERAI, CrossNERliterature, CrossNERmusic, CrossNERpolitics, CrossNERscience `NER-zh`: WEIBONER, boson `RE-zh`: COAE2016, IPRE, SKE2020 `RE-en`: FewRel, Wiki-ZSL `EE-en`: CrudeOilNews, WikiEvents, RAMS `EE-zh`: FewFC, CCF Law It is recommended to have at least 20GB of VRAM for training and inferencing. For more detailed inference, please refer to DeepKE-llm/InstructKGC/6.1.2IE专用模型. OneKE Instruction Format The instructions in OneKE are formatted in a dictionary-type string similar to JSON. It consists of three fields: (1) `'instruction'`, which is the task description, specifies in natural language the role the model plays and the task to be completed; (2) `'schema'`, a list of labels to be extracted, clearly indicates the key fields of the information to be extracted, reflecting the user's needs, and is dynamic and changeable; (3) `'input'`, refers to the source text for information extraction. Below are examples of instructions for various tasks: > Note: In consideration of the complexity of information extraction within specific domains and the high reliance on prompts, we support the integration of Schema descriptions and examples in the instructions to enhance the effectiveness of extraction tasks. For details, refer to `Customized Schema Description Instructions` and `Customized Example Instructions`. Please understand that due to the limited scale of the model, the model output is prompt-dependent and different prompts may yield inconsistent results. Since predicting all schemas in the label set at once is too challenging and not easily scalable, OneKE uses a batched approach during training. It divides the number of schemas asked in the instructions, querying a fixed number of schemas at a time. Hence, if the label set of a piece of data is too long, it will be split into multiple instructions that the model will address in turns. Below is a simple Batched Instruction Generation script: Below is an example using the aforementioned simple script: > '{"instruction": "You are an expert in named entity recognition. Please extract entities that match the schema definition from the input. Return an empty list if the entity type does not exist. Please respond in the format of a JSON string.", "schema": ["person", "organization", "else", "location"], "input": "284 Robert Allenby ( Australia ) 69 71 71 73 , Miguel Angel Martin ( Spain ) 75 70 71 68 ( Allenby won at first play-off hole )"}' For more detailed data conversion, please refer to DeepKE-llm/InstructKGC/READMECN.md/2.3测试数据转换 Knowledge Graph Construction (KGC) Description Instructions Given that example instances can often be lengthy, and due to the limited maximum length of model training, too many examples may inversely affect model performance. Therefore, we suggest providing 2 examples: one positive and one negative, while keeping the number of schemas to one. To extract structured content from the output text and to assess it, please refer to DeepKE-llm/InstructKGC/READMECN.md/7.评估. Continue Training To continue training OneKE, refer to DeepKE-llm/InstructKGC/4.9领域内数据继续训练. Citation If you have used OneKE in your work, please kindly cite the following paper:

llama

ChineseGuard-1.5B

We release the following variants of our harmful content detection model: Run single-input inference using the ChineseGuard-1.5B model: To run inference on the entire ChineseHarm-Bench using ChineseGuard-1.5B and 8 NPUs: > For more configuration options (e.g., batch size, device selection, custom prompt templates), please refer to `singleinfer.py` and `batchinfer.py`. > > Note: The inference scripts support both NPU and GPU devices. Please cite our repository if you use ChineseGuard in your work. Thanks!

zjunlp

SafeEdit-Safety-Classifier

MolGen-large

OneKE

ChineseGuard-1.5B

knowlm-13b-zhixi

OntoProtein

DataMind-Analysis-Qwen2.5-14B

MolGen-large-opt

OceanGPT-basic-7B-v0.1

OceanGPT O 7B

DataMind-Analysis-Qwen2.5-7B

OceanGPT-basic-4B-Thinking

knowlm-13b-ie

OceanGPT-basic-4B-Instruct

InstructCell-instruct

KnowRL-DeepSeek-R1-Distill-Qwen-7B

MolGen-7b

knowlm-13b-base-v1.0

DataMind-14B

InstructCell-chat

OceanGPT-coder-7B

LightThinker-Qwen

OceanGPT-basic-8B

DataMind-7B

chatcell-large

llama-molinst-protein-7b

OceanGPT-basic-14B-v0.1

ChineseGuard-3B

KnowSelf-Gemma2-2B-WebShop

LightThinker-Llama

OceanGPT-coder-0.6B

OceanGPT-basic-2B-v0.1

KnowSelf-Llama3.1-8B-ALFWorld

KnowSelf-Gemma2-2B-ALFWorld

chatcell-small

ChineseGuard-7B

knowlm-7b-base

KnowSelf-Llama3.1-8B-WebShop

zhixi-13b-diff

chatcell-base

OceanGPT-basic-7B-v0.3

mt5-ie

zhixi-13b-lora

llama-molinst-molecule-7b

llama-molinst-biotext-7b

llama2-molinst-molecule-7b

llama-7b-lora-ie

zhixi-13b-diff-fp16

baichuan2-13b-iepile-lora

llama3-instruct-molinst-molecule-8b

llama3-8b-iepile-lora

llama2-molinst-biotext-7b

knowlm-13b-ie-lora

KnowPrompt

alpaca-13b-lora-ie

knowlm-7b-chat

OneGen-EntityLinking-Llama2-7B

OneGen-MultiHop-Llama2-7B

KGEditor

llama2-13b-iepile-lora

HalDet-llava-7b

llama3-instruct-molinst-biotext-8b

OceanGPT-basic-7B-v0.2

WKM-mistral-alfworld

WKM-mistral-sciworld-agent

OneGen-SelfRAG-Llama2-7B

OneGenEmbedding

alpaca-7b-lora-ie

qwen1.5-14b-iepile-lora

HalDet-llava-13b

mistral_alfworld_agent_model_lora

WKM-mistral-alfworld-agent

WKM-mistral-webshop

WKM-mistral-webshop-agent

WKM-mistral-sciworld

WorfGen-7B-Qwen

WorfGen-7B-InternLM

OneKE-gguf

AutoSteer