VOOZH about

URL: https://huggingface.co/divyeshkamalanaban/nucleotide-transformer-2.5b-multi-species-NF4-Q4

⇱ divyeshkamalanaban/nucleotide-transformer-2.5b-multi-species-NF4-Q4 · Hugging Face


Nucleotide Transformer 2.5B NF4

This model is an NF4 (Normal Float 4-bit) quantized version of the base model InstaDeepAI/nucleotide-transformer-2.5b-multi-species. The checkpoint was quantized using the BitsAndBytes library with double quantization enabled and BF16 computation.

The goal of this release is to significantly reduce memory and storage requirements while preserving the representation quality of the original model, making large-scale genomic foundation models more accessible on consumer and research hardware.

Base Model

This checkpoint is derived from:

  • InstaDeepAI/nucleotide-transformer-2.5b-multi-species

Please refer to the original model card for details regarding training data, architecture, intended use cases, limitations, and downstream biological applications.

Quantization Details

  • Quantization Method: BitsAndBytes NF4
  • Precision: 4-bit
  • Quantization Type: NF4 (Normal Float 4)
  • Double Quantization: Enabled
  • Compute Dtype: BF16

Benchmark Summary

The quantized model was evaluated against the original BF16 model using hidden-state similarity and embedding preservation metrics.

Metric Value
Embedding Cosine Similarity 0.9868
Hidden State MAE 0.1435
Hidden State L2 Distance 119.70

These results indicate that the quantized model closely preserves the representations produced by the original checkpoint while substantially reducing memory requirements.

Intended Use

This model is suitable for:

  • Genomic sequence embeddings
  • DNA representation learning
  • Biological sequence analysis
  • Taxonomic and biodiversity research
  • Downstream bioinformatics tasks
  • Resource-constrained inference environments

Usage

from transformers import AutoModelForMaskedLM, AutoTokenizer

model_id = "divyeshkamalanaban/nucleotide-transformer-2.5b-nf4"

tokenizer = AutoTokenizer.from_pretrained(model_id)

model = AutoModelForMaskedLM.from_pretrained(
 model_id,
 device_map="auto",
)

sequence = "ACGT" * 250

inputs = tokenizer(
 sequence,
 return_tensors="pt",
)

outputs = model(**inputs)

Acknowledgements

This work builds upon the Nucleotide Transformer project developed by InstaDeep and collaborators. All credit for the original model architecture, training pipeline, datasets, and research contributions belongs to the original authors.

This repository provides a community-maintained NF4 quantized checkpoint intended to improve deployment efficiency and accessibility.

Downloads last month
41
Safetensors
Model size
3B params
Tensor type
F32
·
U8
·

Model tree for divyeshkamalanaban/nucleotide-transformer-2.5b-multi-species-NF4-Q4

Quantized
(1)
this model

Datasets used to train divyeshkamalanaban/nucleotide-transformer-2.5b-multi-species-NF4-Q4