VOOZH about

URL: https://apify.com/vkuprin/universal-ai-page-monitor

⇱ AI Page Monitor β€” Amazon, LinkedIn, GitHub Change Detection Β· Apify


Pricing

Pay per event

Go to Apify Store

Universal AI Page Monitor

Monitor any URL for changes β€” Amazon products, LinkedIn profiles, job boards, pricing pages, Hacker News, GitHub. AI generates CSS selectors + regex filters in plain English. Works in any language. No API key needed. MCP-ready for Claude, Cursor, Codex, Gemini, Cline.

Pricing

Pay per event

Rating

0.0

(0)

Developer

πŸ‘ Vitaly Kuprin

Vitaly Kuprin

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

0

Monthly active users

2 months ago

Last modified

Share

Monitor Amazon products, LinkedIn profiles, job boards, pricing pages, Hacker News, GitHub releases, Shopify stores, or any URL for changes. Plain-English intent in β†’ CSS selectors + regex filters out. Ready to plug into any change-detection system or call as a tool from any MCP client.

  • βœ… No OpenAI/Anthropic API key needed. AI cost is included in the per-call event price.
  • βœ… Works on any URL in any language (English, Russian, Spanish, Chinese, Arabic, etc.).
  • βœ… MCP-ready for Claude Desktop, Cursor, Codex, Gemini Code Assist, Cline, Continue, and every MCP-compatible client.
  • βœ… Typical run cost: ~$0.13 (init + page fetch + 1 AI call + result). 100 pages/month β‰ˆ $13.

What it does

Given a URL, the Actor:

  1. Fetches the page (Playwright by default; HTTP for static pages).
  2. Extracts main content with Defuddle β€” picks the right contentSelector for monitoring.
  3. Detects schema.org page type (Product, Article, JobPosting, …) from JSON-LD.
  4. Strips ads & cookie banners via EasyList + Fanboy's Cookie List (pre-baked at build time).
  5. Generates noise filters (regex ignore_text + CSS subtractive_selectors) for ephemeral content like timestamps and counters.
  6. Generates trigger patterns when you provide a plain-English intent β€” e.g. "notify when sold out" β†’ regex that fires only on real change.

Multilingual by design β€” the prompts include explicit examples in Russian, Spanish, Chinese, and Arabic. When the page is in a non-English language, the engine emits ignore_text regex patterns in that language verbatim (e.g. /Π½Π°ΠΉΠ΄Π΅Π½ΠΎ\s+[\d\s]+\s*вакансии/i for a Russian SERP).

Inputs

FieldTypeRequiredDescription
urlstringyesAny HTTP(S) URL
intentstringnoPlain-English description of the change you want detected
fetchModehttp | playwrightno (default playwright)http is cheap and fast for static pages; playwright renders JS-heavy SPAs
screenshotboolno (default false)Save full-page PNG to KV store (Playwright mode only)
useEasyListboolno (default true)Apply EasyList + Fanboy's Cookie List
userAgentstringnoOverride the User-Agent for the page fetch

No API keys to paste. AI cost is included in the ai-call event price below.

Output

One row per analyzed URL, written to the run's default dataset:

{
"url": "https://news.ycombinator.com",
"fetchedAt": "2026-05-09T13:30:00.000Z",
"fetchMode": "playwright",
"pageType": "website",
"contentSelector": "#hnmain",
"includeFilters": ["#hnmain"],
"subtractiveSelectors": [".pagetop", "td.title:has(.titleline)"],
"triggerText": ["/(?:show hn|ask hn):/i"],
"ignoreText": ["/\\d+\\s*points?/i", "/posted\\s+\\d+/i"],
"explanation": "Configured content selector and noise filters for a list page.",
"warning": null,
"screenshotKey": null,
"aiTokens": { "input": 920, "output": 180 },
}

The output schema matches SmartFilterResult exactly β€” feed it directly into any change-detection backend that accepts CSS include/subtract filters, regex ignore_text, and regex trigger_text.

Pricing

Pay-per-event. Charges are configured in the Apify Console:

  • init β€” once per run
  • page-fetched β€” once per successfully fetched page
  • ai-call β€” once per LLM call (only when intent is set; AI cost is included)
  • screenshot β€” once if screenshot: true
  • result-pushed β€” once per dataset row

A typical run with intent set, Playwright fetch, no screenshot: 1Γ— init + 1Γ— page-fetched + 1Γ— ai-call + 1Γ— result-pushed. No external accounts or API keys required β€” the AI cost is rolled into the ai-call event price.

MCP usage (works with every MCP client)

Apify exposes Store Actors as MCP tools natively via the Apify MCP server. The same configuration works in every MCP-compatible client:

  • Claude Desktop β€” ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) / %APPDATA%\Claude\claude_desktop_config.json (Windows)
  • Cursor β€” ~/.cursor/mcp.json
  • OpenAI Codex (the desktop client and the API's tools surface)
  • Google Gemini Code Assist (when MCP is enabled)
  • Cline (VS Code extension) β€” cline_mcp_settings.json
  • Continue.dev β€” ~/.continue/config.json
  • OpenClaw and other community MCP clients
  • Custom MCP clients built with the official MCP SDK (Python, TypeScript, Go)

Add this block to your client's MCP config:

{
"mcpServers": {
"apify": {
"command": "npx",
"args": ["-y", "@apify/actors-mcp-server"],
"env": { "APIFY_TOKEN": "<your-apify-token>" }
}
}
}

Then ask your agent something like:

"Monitor https://news.ycombinator.com and tell me which CSS selectors and regex patterns I should use to detect when a new top story appears."

The agent calls this Actor with url + intent, gets the returned filters, and uses them to wire up monitoring elsewhere β€” no manual API key handling, no platform-specific glue code.

Examples

// Track price drops on a product page (AI-driven trigger patterns)
{
"url": "https://www.amazon.com/dp/B08N5WRWNW",
"intent": "Notify me when the price drops below $200",
"fetchMode": "playwright"
}
// Article changelog monitoring (no AI β€” noise filters only, lower cost)
{
"url": "https://nextjs.org/docs/app/api-reference/components",
"fetchMode": "http",
"useEasyList": false
}
// Job board monitoring with screenshot
{
"url": "https://news.ycombinator.com/jobs",
"intent": "Track new senior backend engineering jobs in Berlin",
"screenshot": true
}

Limits

  • Single URL per run. For multi-URL workflows, run the Actor once per URL or use Apify's Task scheduler / Schedule integration.
  • Defuddle returns title/markdown separately β€” contentSelector may be null for unstructured pages where Defuddle falls back to Turndown.
  • LLM call uses gpt-4.1-mini under the hood (cheap, fast, deterministic-leaning). AI cost is included in the ai-call event price β€” no API key for you to manage. The aiTokens field in the output is for transparency only.

Source & methodology

Engine: @site-spy/smart-config-core β€” the same module used in production by the Site Spy change-detection service. Tested on 1,066 real production pages with 99.4% configure-success β€” works on sites that break generic scrapers (A/B-tested layouts, hashed CSS classes, JS-heavy SPAs, non-English content).

Source is bundled into the Actor image at publish time. Hybrid architecture: deterministic extractors (Defuddle, JSON-LD, EasyList, regex pattern library) do most of the work; the LLM (gpt-4.1-mini) is only invoked for noise + trigger generation when the user provides intent.

You might also like

MCP tools – Turn Any Website into an AI Tool in 60 Seconds

clever_fashion/mcp-website-tool

Automatically extract buttons, inputs & forms from any site and get ready-to-use MCP (Model Context Protocol) tools for Cursor, Claude, Claude Desktop, Windsurf, Cline, and any MCP-compatible AI.

πŸ‘ User avatar

Data Farming Team

8

AI Web Scraper

crawlworks/ai-web-scraper

Scrape any webpage with a URL and a plain-English prompt. Get structured JSON output powered by AI β€” no coding, no selectors, no configuration.

Reddit MCP Server β€” Claude, ChatGPT, Cursor, Codex

makework36/reddit-mcp-server

Native Reddit MCP server for AI agents. 7 Reddit tools (search, subreddits, posts+comments, users, trending) over Streamable HTTP. Works with Claude Desktop, Cursor, ChatGPT, OpenAI Codex, Agents SDK, Windsurf. No Reddit API key. Pay per tool call.

πŸ‘ User avatar

deusex machine

31

Best AI Web Scraper

hgservices/Best-AI-Web-Scraper

Extract any data from any website by simply describing what you want in plain English. AI-powered web scraping with no code, no selectors, and no per-site setup.

Web Page Change Monitor

dramatic_jonquil/web-page-change-monitor

Monitor any web page for meaningful changes. Watch full pages or specific CSS selectors, filter noisy timestamps and tokens, and get diffs plus optional webhook alerts when content actually changes.

Claude AI Web Automation

dtrungtin/claude-ai-web-automation

A real browser with Anthropic's Claude models to navigate any website and extract structured data β€” no CSS selectors or page-specific scraping code required.

AI Web Scraper β€” Structured Data From Any URL

muhammadafzal/ai-web-extractor

Extract structured data from any website using an LLM and your own field schema β€” no CSS selectors. Give it URLs and the fields you want; get clean JSON rows back. Works on blogs, job boards, product pages, listings, and more.

πŸ‘ User avatar

Muhammad Afzal

-