VOOZH about

URL: https://github.com/The-Data-Dilemma

⇱ The Data Dilemma Β· GitHub


Skip to content

The Data Dilemma πŸš€

Open Source AI Solutions β€’ Real-World Impact
Open to collaboration β€” Let’s build something impactful together!🀝

πŸ‘ Hugging Face
πŸ‘ LinkedIn
πŸ‘ Email

✨ About

Building open-source AI tools that solve real-world problems across multiple domains. We specialize in multilingual AI, privacy-preserving solutions, and deployable models with a focus on underserved communities and languages.

🎯 Core Focus: Open Source β€’ Multilingual AI β€’ Privacy-First β€’ Real-World Deployment


πŸš€ Featured Projects

πŸŽ™οΈ Medibeng-Orpheus TTS

Fine-tuned TTS for Bengali-English code-switching. Built on Orpheus + LLaMA-3b.

Lightweight ASR for Bengali-English conversations and translation.

πŸ›‘οΈ MediRag Guard

Privacy-focused RAG system with comprehensive data insights.

Real-time AI chat with FastAPI + WebSocket + Groq.


πŸ“„ Research


🀝 Contributing

Code: Fork β†’ Branch β†’ PR
Research: Test models, share datasets, collaborate
Community: Documentation, tutorials, discussions

git clone https://github.com/The-Data-Dilemma/[repo-name]
# Make changes, commit, push, PR

πŸ› οΈ Tech Stack

AI: HuggingFace β€’ Whisper β€’ Orpheus β€’ PyTorch β€’ Unsloth
Web: FastAPI β€’ WebSocket β€’ Docker
Focus: Multilingual β€’ Code-switching β€’ Privacy β€’ Deployment


πŸ“Š Impact

πŸš€ Open Source Projects: 4+
πŸ“ Research Papers: 3+
🌍 Languages: Bengali + English (expanding)
⭐ Community Stars: 50+

πŸ“ Contact

Email: me@promila.info
Location: Khulna, Bangladesh πŸ‡§πŸ‡©
LinkedIn: The Data Dilemma


πŸš€ Explore Projects β€’ πŸ’¬ Join Community β€’ 🌐 Visit Website

Open-source AI for everyone, one commit at a time ❀️

Pinned Loading

  1. GroqStreamChain is a real-time AI-powered chat app using FastAPI, WebSocket, and Groq. It streams AI responses for interactive, low-latency communication with session management and a clean, respon…

    Python 32 10

  2. MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech into English, making it easier for analysis, record-keeping,…

    Python 29 3

  3. MediRag-Guard Public

    A RAG Proof of Concept that delivers comprehensive, context-aware insights on healthcare data privacy through a novel knowledge tree.

    Python 14

  4. ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scri…

    Python 9 4

  5. Medibeng-Orpheus-3b-0.1-ft- A TTS model for bilingual Bengali-English code-switching in healthcare, fine-tuned for seamless patient-doctor interactions.

    Python 6 1

Repositories

Showing 6 of 6 repositories
You can’t perform that action at this time.