VOOZH about

URL: https://apify.com/extremescrapes/wordpress-articles-scraper

โ‡ฑ WordPress Articles Scraper ยท Apify


Pricing

$80.00 / 1,000 results

Go to Apify Store

WordPress Articles Scraper

The WordPress Articles Scraper is an Apify actor that extracts posts and metadata from any WordPress website using the WordPress REST API. It automatically handles pagination and fetches additional information like author details, categories, tags, and featured images.

Pricing

$80.00 / 1,000 results

Rating

0.0

(0)

Developer

๐Ÿ‘ Extreme Scrapes

Extreme Scrapes

Maintained by Community

Actor stats

2

Bookmarked

136

Total users

4

Monthly active users

4 months ago

Last modified

Share

Overview

The WordPress Articles Scraper is an Apify actor that extracts posts and metadata from any WordPress website using the WordPress REST API. It automatically handles pagination and fetches additional information like author details, categories, tags, and featured images.

This actor is perfect for researchers, content aggregators, and developers who need structured data from WordPress sites.

How It Works

  1. You provide the WordPress URL.
  2. The actor fetches posts, handling pagination automatically.
  3. If a search keyword is provided, it filters results accordingly.
  4. It extracts metadata such as author name, categories, tags, and featured images.
  5. The final structured JSON output includes all relevant post details.

Features

โœ… Fetches posts from any WordPress site โœ… Supports pagination until all posts are retrieved โœ… Filters posts based on search terms โœ… Extracts metadata like author, categories, tags, and featured images โœ… Provides clean and structured JSON output

Getting Started

1. Input Parameters

To use the scraper, provide the following inputs:

ParameterTypeRequiredDescription
startUrlsArrayโœ…List of URLs to start crawling from (e.g., [{"url": "https://example.com", "method": "GET"}]).

2. Running the Actor

You can run the actor directly on Apify or via API:

Using Apify Interface

  1. Navigate to the actor's Apify page.
  2. Enter the required parameters.
  3. Click Run and wait for the data to be scraped.

Using Apify API

curl-X POST -H"Content-Type: application/json"\
-d'{"maxRequestsPerCrawl": 1, "perPage": 10, "startUrls": [{"url": "https://example.com", "method": "GET"}]}'\
"https://api.apify.com/v2/acts/YOUR_ACTOR_ID/runs?token=YOUR_API_TOKEN"

Output Format

The output is a JSON dataset containing structured post details:

[
{
"id":123,
"date":"2025-03-28T12:00:00",
"modified":"2025-03-28T14:00:00",
"slug":"example-post",
"link":"https://example.com/example-post",
"title":"Example Post Title",
"content":"<p>This is an example post content...</p>",
"excerpt":"This is a short summary...",
"author":"John Doe",
"categories":["Technology","News"],
"tags":["AI","Programming"],
"featured_image":"https://example.com/wp-content/uploads/featured-image.jpg",
"extra_metadata":{
"author_bio":"John Doe is a technology journalist...",
"category_description":"Latest news in tech industry..."
}
}
]

Use Cases

  • Content Aggregation โ€“ Collect and analyze posts from different WordPress sites.
  • SEO Research โ€“ Extract content and metadata for SEO analysis.
  • Data Science โ€“ Gather datasets for NLP or sentiment analysis.
  • Backup and Archiving โ€“ Store blog content for future reference.

Support & Contributions

If you encounter any issues or have feature requests, feel free to open an issue or contribute to the project. Happy scraping! ๐Ÿš€

You might also like

WordPress Scraper

jupri/wordpress

๐Ÿ’ซ Scrape WordPress and Woocommerce websites

โœจ WordPress Content Extractor

ramman/wordpress-content-extractor

๐Ÿ”Easily scrape and export posts, pages, metadata, images, and comments from any WordPress site. โœจ WordPress content to JSON, CSV, or TXT โ€” instantly.

WordPress Post Scraper

hgservices/wordpress-post-scraper

Extract every blog post from any WordPress site โ€” title, content, date, author, image, categories and tags.

Website Tech Stack Detector โ€” 100+ Technologies

ryanclinton/website-tech-stack-detector

Identify the technologies, frameworks, and services running on any website. Website Tech Stack Detector crawls one or more URLs, inspects HTTP headers, HTML meta tags, script sources, and body content, then matches them against a fingerprint database of 106 web technologies across 17 categories.

32

๐ŸŽฏ Facebook Ad Copywriter Creator

powerai/facebook-ad-copywriter-creator

Transform your Facebook ad campaigns with AI-powered copywriting! Creates engaging, conversion-focused ad copy tailored to your target audience, complete with compelling headlines, persuasive body text, and effective calls-to-action - all delivered in multiple variations for A/B testing.

WordPress Integration

new-world-scripts/wordpress-integration

Manage WordPress content from Apify. Pull WordPress posts and pages, upload draft or published posts from JSON input, and delete WordPress posts by ID using the WordPress REST API.

๐Ÿ‘ User avatar

New World Scripts

1

5.0

WordPress Posts Scraper - Extract Articles & Metadata

devnaz/wordpress-posts-scraper

Extract posts, articles, and metadata from any WordPress site using REST API. 20+ filters: date ranges, categories, tags, 0authors, search keywords. Get title, content, author bio, featured images & more. No WordPress account needed. Fast, reliable data extraction for content aggregation & research.

Wordpress Email Scraper - Advanced, Fast & Cheapest

contacts-api/wordpress-email-scraper-fast-advanced-and-cheapest

๐ŸŒ WordPress Email Scraper finds emails from WordPress websites, blogs, and author pages fast โšก Ideal for outreach, partnerships, and SEO campaigns ๐Ÿ“ง

Google Keyword Suggestions Scraper

powerai/google-keywords-suggest-scraper

Get Google keyword suggestions and insights including search volume, competition level, and bid estimates for any keyword.

Wordpress Email Scraper

scraper-mind/wordpress-email-scraper-fast

WordPress email scraper to extract emails from WordPress websites, blogs, and contact pages ๐Ÿ“ง๐ŸŒ Perfect for B2B lead generation, outreach campaigns, and building targeted website owner contact lists. Fast, accurate, and reliable.