VOOZH

URL: https://github.com/topics/docx-parser

⇱ docx-parser · GitHub Topics · GitHub

#

docx-parser

Here are 24 public repositories matching this topic...

ispras / dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

html pdf ocr table-of-contents excel html-parser docx documents doc scanned-documents txt document-analysis odt pdf-parser table-recognition docx-parser document-content-extraction logical-structure-extraction

Updated
Python

erdos / stencil

templating engine for DOCX and PPTX files

template-engine docx ooxml pptx ooxml-parser docx-generator docx-template docx-parser

Updated
Clojure

has-abi / docparser

Extract text from your DOCX documents.

text-parser document-parser doc-parser docx-parser

Updated
Python

PranavMishra17 / Resume-Craft-Pro

A full-stack AI web app for Resume customization with intelligent agentic architecture

react nlp mcp nextjs job-search resume-builder ai-agents tailwindcss gemini-api docx-parser resume-optimization keyword-extractio

Updated
TypeScript

sarabjit1003 / resume-tracker

A smart resume screening tool that matches resumes to job descriptions using Streamlit and Python.

python data-analysis pdf-parser job-matching ai-project streamlit docx-parser career-tools resume-tracker portfolio-proj

Updated
Python

lukethacoder / docx-to-html

📃 A GUI based docx to html parser. Useful for ripping out inline styles of docx files.

docx rich-text docx-parser

Updated
HTML

omar2535 / BioLife-AU-01-attendance-parser

Biolife-AU-01 打卡鐘解析程序

parser html-parser docx docx-parser

Updated
HTML

👁 xsukax-Word-Document-Comparison-Tool

xsukax / xsukax-Word-Document-Comparison-Tool

A powerful, privacy-focused web application for side-by-side comparison of Word documents with intelligent diff highlighting, comprehensive analytics, and multilingual support including Arabic and RTL languages.

word-diff visual-diff document-analysis document-processing single-page-application diff-viewer change-tracking comparison-tool word-processor document-comparison docx-parser office-documents rtl-support content-comparison word-document-diff docx-comparison side-by-side-diff text-difference-viewer file-comparison-tool similarity-checker

Updated
Python

toe-dot-tech / CBT-Quiz-Windows

Enterprise-grade Windows desktop application for Computer-Based Testing with real-time monitoring, automated grading, and offline capabilities.

dart windows-desktop flutter csv-parser shelf cbt school-management realtime-monitoring education-technology examination-system assessment-tool docx-parser riverpod offiline-first testing-platform assesment-tool

Updated
JavaScript

Talabov / Resume-Parser-API

Extract key details from resumes (PDF or DOCX) via a fast Flask API. Returns name, contact info, skills, experience, and education in clean JSON.

resume-parser flask-api pdf-processing docx-parser hr-tools

Updated

EDeev / api_processor

Django REST API для транскрипции аудио и извлечения данных из документов с gRPC интеграцией

python nlp machine-learning django ffmpeg rest-api grpc speech-recognition pdf-parser document-processing backend-api file-processing audio-transcription vosk docx-parser

Updated
Python

FayazK / Document-Metadata-Extractor

A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.

nlp text-analysis data-extraction document-management metadata-extraction pdf-parser document-processing excel-export json-output python-automation docx-parser generative-ai gemini-ai-project content-indexing

Updated
Python

simons-hub / rust-word-analyzer

Rust word frequency analyzer using custom linked list data structures in Rust — an educational project exploring Rust ownership, unsafe, and pointer semantics

rust linked-list educational merge-sort data-structures-and-algorithms word-frequency unsafe-rust docx-parser

Updated
Rust

gawankarsanket / n8n-universal-document-parser

Universal document parser for n8n that converts multiple document formats (PDF, DOCX, DOC, Google Docs, TXT) into plain text using the Google Drive API. This workflow eliminates the need for separate parsing nodes by using Google’s conversion engine to extract text from any supported document type

automation text-extraction workflow-automation pdf-parser google-drive-api document-parser google-docs-api n8n docx-parser n8n-workflow ai-automation word-parser

Updated

coffeemesh / compareFootnotes

Small script for comparing footnotes on .docx files. Resulting in a .csv

python script compare-text docx-parser docx2python

Updated
Python

cuiyuheng / docling

🥚 Transform PDF to JSON or Markdown with ease and speed 🐣

ai markdown-parser html-parser pdf-parser image-parser docx-parser

Updated
Python

kchernokozinsky / paper-sage

AI-powered student assignment evaluator written in Rust. Supports code, PDF, and DOCX files. Uses local or remote LLMs to grade submissions based on configurable criteria, and exports results to Excel.

rust education ai grading openai code-review gpt student-assignments cli-tool excel-export pdf-processing docx-parser llm ollama automated-evaluation

Updated
Rust

MVjimboUniversity / parsers

python pdf-parser beautifulsoup4 docx-parser

Updated
Python

Imtiazsalaf-01 / Automated-Resume-Parser

Automated Resume Parser – Built at Codec Technologies during internship. Designed an intelligent parser that extracts candidate details (name, contact, skills, experience, education) from PDF/DOCX resumes and converts them into structured data, enhancing recruitment efficiency.

python nlp information-retrieval automation regex text-extraction file-handling data-extraction structured-data data-preprocessing resume-parser pdf-parser docx-parser recruitment-technology codec-technologies

Updated
Python

sbecker11 / resume-flyer

nodejs javascript typescript 3d pdf-parser claude single-page-application vue3 vite motion-parallax docx-parser resume-viewer resume-editor anthropic color-palette-selector

Updated
JavaScript

Improve this page

Add a description, image, and links to the docx-parser topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the docx-parser topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.