VOOZH about

URL: https://apify.com/parseforge/etymonline-scraper

⇱ Etymonline Word Etymology Scraper Β· Apify


πŸ‘ Etymonline Word Etymology Scraper avatar

Etymonline Word Etymology Scraper

Pricing

from $13.00 / 1,000 result items

Go to Apify Store

Etymonline Word Etymology Scraper

Pull word etymologies from the Online Etymology Dictionary. Returns headword, part of speech, etymology essay, related cross-references, century of origin, and direct URL. Search by keyword or look up specific words. Useful for linguists, writers, dictionary apps.

Pricing

from $13.00 / 1,000 result items

Rating

0.0

(0)

Developer

πŸ‘ ParseForge

ParseForge

Maintained by Community

Actor stats

0

Bookmarked

3

Total users

1

Monthly active users

a month ago

Last modified

Share

πŸ‘ ParseForge Banner

πŸ“œ Etymonline Word Etymology Scraper

πŸš€ Pull word etymologies from the Online Etymology Dictionary: headword, etymology essay, century of origin, related words.

πŸ•’ Last updated: 2026-05-07 Β· πŸ“Š 11 fields per record Β· 50,000+ word etymologies Β· headword, part of speech, etymology essay, related cross-references, century of origin

The Etymonline Word Etymology Scraper pulls word histories from the Online Etymology Dictionary, the most cited free source for English word origins. Output includes the headword, part of speech, etymology essay (HTML and plain text), summary, century or period of origin, related cross-reference words, and direct URL to the source page.

The dictionary covers 50,000+ English words and phrases with etymologies tracing roots through Old English, Middle English, French, Latin, Greek, Norse, and beyond. The Actor has two modes: search by keyword to discover related words, or look up a specific list of words directly.

🎯 Target AudienceπŸ’‘ Primary Use Cases
Linguists, writers, content marketers, NLP/ML pipelines, vocabulary apps, language learners, journalistsLinguistic research, word-of-the-day newsletters, vocabulary apps, NLP training corpora, content writing on word origins

πŸ“‹ What the Etymonline Word Etymology Scraper does

Five filtering workflows in a single run:

  • πŸ” Search mode. Search a keyword and return ranked word results.
  • πŸ“š Lookup mode. Pass a list of words and pull each one's etymology directly.
  • πŸ“œ Full essay text. HTML and plain-text etymology with cross-references resolved.
  • πŸ“… Century detection. Heuristic extraction of the period a word was first attested.
  • πŸ”— Related words. Cross-references parsed from the body.

πŸ’‘ Why it matters: clean, server-side filtering and fresh data on every run.


🎬 Full Demo

🚧 Coming soon: a 3-minute walkthrough showing how to go from sign-up to a downloaded dataset.


βš™οΈ Input

InputTypeDefaultBehavior
maxItemsinteger10Records to return. Free plan caps at 10, paid plan up to 1,000,000.
modestring"search"search or lookup.
querystring"hello"Keyword to search across the dictionary.
wordsstringnewline listDirect word lookup list.

Example: search words related to language.

{
"maxItems":50,
"mode":"search",
"query":"language"
}

Example: look up specific words.

{
"maxItems":10,
"mode":"lookup",
"words":"hello\nworld\nlanguage\netymology"
}

πŸ“Š Output

Each record contains 11 fields. Download as CSV, Excel, JSON, or XML.

🧾 Schema

FieldTypeExample
πŸ”€ wordstring"hello"
πŸ“› headwordDisplaystring"hello (interj.)"
🏷️ partOfSpeechstring"interj."
πŸ“œ etymologyTextstring"greeting between strangers, especially through telephone..."
πŸ“œ summarystring"greeting between strangers..."
πŸ“… centuryFirstAttestedstringnull
πŸ”— relatedWordsarray["hallo","holla","ahoy"]
πŸ”’ relatedCountnumber3
πŸ”— etymonlineUrlstring"https://www.etymonline.com/word/hello"

πŸ“¦ Sample records


✨ Why choose this Actor

Capability
πŸ“š50,000+ words. Most cited free etymology source online.
πŸ“…Century detection. Heuristic period extraction for time-of-first-attestation analysis.
πŸ”—Cross-references. Related words parsed automatically.
⚑Fast. 100 lookups in under a minute.
βš–οΈPublic source. Free public reference dictionary.

πŸ“ˆ How it compares to alternatives

ApproachCostCoverageRefreshFiltersSetup
⭐ This Actor$5 free credit50,000+ wordsLive per runsearch or direct list lookup⚑ 2 min
Manual etymonline browseFreeManualLiveWeb onlyπŸ•’ Manual
OED API$$Larger but paywalledLiveYes🐒 Subscription
Wiktionary scrapingFreeMixed qualityLiveDIY🐒 Days

πŸš€ How to use

  1. πŸ“ Sign up. Create a free account with $5 credit (takes 2 minutes).
  2. 🌐 Open the Actor. Find the Etymonline Word Etymology Scraper on the Apify Store.
  3. 🎯 Set input. Pick filters and maxItems.
  4. πŸš€ Run it. Click Start.
  5. πŸ“₯ Download. Grab results in the Dataset tab as CSV, Excel, JSON, or XML.

⏱️ Total time from signup to dataset: 3-5 minutes. No coding required.


πŸ’Ό Business use cases

πŸ“° Content + Newsletters

  • "Word of the day" content
  • Vocabulary newsletter content
  • Etymology-driven blog posts
  • Trivia and curiosity articles

πŸŽ“ Education + Apps

  • Vocabulary builder apps
  • Language-learning supplements
  • Etymology games
  • Student reference tools

πŸ€– NLP + ML

  • Etymological feature engineering
  • Train word-history classifiers
  • Build linguistic-history embeddings
  • Corpus enrichment

πŸ”¬ Linguistics Research

  • First-attestation studies
  • Loanword analyses
  • Period-of-origin distributions
  • Cross-language tracing

πŸ”Œ Automating Etymonline Word Etymology Scraper

Control the scraper programmatically:

  • 🟒 Node.js. Install the apify-client NPM package.
  • 🐍 Python. Use the apify-client PyPI package.
  • πŸ“š See the Apify API documentation for full details.

The Apify Schedules feature lets you trigger this Actor on any cron interval.


🌟 Beyond business use cases

Data like this powers more than commercial workflows.

πŸŽ“ Research and academia

  • Computational linguistics
  • Reproducible word-history snapshots
  • Course materials
  • Cross-period corpora

🎨 Personal and creative

  • Personal vocabulary databases
  • Etymology blogs
  • Side projects
  • Newsletter content

🀝 Non-profit and civic

  • Cultural literacy outreach
  • Educational accessibility
  • Free reference compilation
  • Heritage-language preservation

πŸ§ͺ Experimentation

  • Train word-history classifiers
  • Prototype etymology chat agents
  • Build linguistic visualizations
  • Test text-mining pipelines

πŸ€– Ask an AI assistant about this scraper

Open a ready-to-send prompt in the AI of your choice:


❓ Frequently Asked Questions

🧩 How does it work?

Search mode finds words related to a keyword via the dictionary's search index. Lookup mode fetches each word in your list directly. Each match returns the parsed etymology page.

πŸ“š How many words are in the dictionary?

50,000+ English words and phrases, with new entries added regularly by the maintainer.

πŸ“Š How many fields per record?

11, including word, part of speech, etymology essay, summary, century of origin, related words, and source URL.

πŸ“… How accurate is century detection?

Heuristic. The Actor extracts the first matching \d{2,4}c. pattern from the etymology text. Always verify against the source for citations.

πŸ”— Are related words bidirectional?

No. Relations are extracted from the current page's body. Reverse relations may not appear.

πŸ” Can I schedule runs?

Yes. New entries are added regularly; weekly schedules capture additions.

βš–οΈ Is this data public?

Yes. Etymonline is a free public reference dictionary. The Actor reads only public pages.

πŸ’³ Do I need a paid Apify plan?

No. The free plan covers preview runs.

πŸ†˜ What if a word isn't in the dictionary?

The lookup is skipped silently with a debug log. Etymonline covers most common English words but isn't exhaustive for rare/specialized vocabulary.

🌐 Does it support languages other than English?

No. The dictionary tracks English words; entries reference foreign-language roots but only English headwords are indexed.


πŸ”Œ Integrate with any app

Etymonline Word Etymology Scraper connects to any cloud service via Apify integrations:

  • Make - Automate multi-step workflows
  • Zapier - Connect with 5,000+ apps
  • Slack - Get run notifications
  • Airbyte - Pipe data into your warehouse
  • GitHub - Trigger runs from commits
  • Google Drive - Export datasets to Sheets

πŸ”— Recommended Actors

πŸ’‘ Pro Tip: browse the complete ParseForge collection for more reference-data scrapers.


πŸ†˜ Need Help? Open our contact form to request a new scraper, propose a custom data project, or report an issue.


⚠️ Disclaimer: this Actor is an independent tool and is not affiliated with, endorsed by, or sponsored by Online Etymology Dictionary, its maintainers, or any cited reference work. All trademarks mentioned are the property of their respective owners. Only publicly available open data is collected.

You might also like

Dictionary & Thesaurus API

hanamira/dictionary-thesaurus

Look up any English word and get definitions, pronunciations with audio, synonyms, antonyms, etymology, and example sentences. Bulk word lookup supported. Perfect for writing apps, language learning tools, chatbots, and educational content.

Dictionary Word Definitions Scraper

parseforge/dictionary-api-scraper

Pull English word definitions, phonetics, audio pronunciations, parts of speech, examples, synonyms, and antonyms. Look up a word list or paste a paragraph and the Actor breaks it into per-word records. Useful for language apps, NLP, vocabulary builders, content tools.

Oxford English Dictionary

pokeball/oxford-english-dictionary

Scrape words, definitions, part of speech, percentage_popularity from the Oxford English Dictionary

Word & Character Counter

moving_beacon-owner1/word-character-counter

A simple tool to count words in text or from a webpage URL. Provides total word count, unique words, and word frequency analysis. Perfect for content analysis, SEO, or text analytics.

2

Free Dictionary Scraper

gio21/free-dictionary-scraper

Look up word definitions, phonetics, audio pronunciation, synonyms, antonyms, examples via the Free Dictionary API. No API key. Multi-language. For dictionaries, language learning, NLP datasets.

Wiktionary Definitions Scraper

parseforge/wiktionary-definitions-scraper

Fetch dictionary definitions from Wiktionary in 9 source languages. Returns part of speech, definitions, examples, and cross-language meanings per word. Plain-text and HTML output for one-shot or bulk word lists.

Cambridge Dictionary Scraper

alvaraaz/cambridge-dictionary-actor

Search words in the Cambridge Dictionary with this actor. Get definitions, examples, phonetics and CEFR levels.

πŸ‘ User avatar

Jose Fernando Álvarez Romero

4

Related articles

Python dictionaries: a comprehensive guide for devs
Read more
Sentiment analysis in Python (Complete guide for 2025)
Read more