Image Classification • 92.9M • Updated • 19
Search is not available for this dataset
image imagewidth (px) 611 1.47k | label class label 6
classes |
|---|---|
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD | |
0AD |
End of preview. Expand in Data Studio
Document-Type-Detection
Dataset Summary
The Document-Type-Detection dataset is a large-scale image classification dataset consisting of scanned or photographed document images. Each image is categorized into one of nine document types. This dataset is ideal for training document classification models in finance, administration, OCR, and automation workflows.
Supported Tasks
Multiclass Document Classification Classify an input document image into one of the predefined document types.
Dataset Type: Image Classification
Task: Document Type Detection (Multiclass)
License: Apache 2.0
Each document class is stored in its corresponding folder.
Dataset Structure
| Feature | Type | Description |
|---|---|---|
| image | Image | Document image (variable resolution) |
| label | Category | Integer from 0 to 8 representing class |
Split:
train: Full dataset in a single training split.
Example Usage
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Document-Type-Detection")
sample = dataset["train"][0]
image = sample["image"]
label = sample["label"]
License
This dataset is distributed under the Apache 2.0 License.
- Downloads last month
- 38
