VOOZH about

URL: https://apify.com/shahidirfan/openverse-image-scraper

⇱ OpenVerse Image Scraper Β· Apify


Pricing

Pay per usage

Go to Apify Store

OpenVerse Image Scraper

Bulk-download Creative Commons images from OpenVerse instantly. Extract photo URLs, metadata, licenses & attribution automatically. Ideal for content creation, web design, research projects & AI training datasets. Fully structured, legally compliant sourcing.

Pricing

Pay per usage

Rating

0.0

(0)

Developer

πŸ‘ Shahid Irfan

Shahid Irfan

Maintained by Community

Actor stats

0

Bookmarked

2

Total users

1

Monthly active users

13 days ago

Last modified

Share

Extract comprehensive data from the OpenVerse database with ease. Collect openly licensed and public domain images at scale, perfectly suited for building AI training datasets, sourcing stock photos, and comprehensive research.

Features

  • Keyword Search β€” Find specific images matching your query.
  • License Filtering β€” Narrow down results to specific Creative Commons licenses or public domain status.
  • Source Selection β€” Target specific providers like Flickr, Wikimedia, Rawpixel, and more.
  • High Performance β€” Extract data swiftly without limitations using direct API access.
  • Clean Data β€” Automatically removes empty or null values from the extracted datasets to keep your data structured and pristine.

Use Cases

Machine Learning and AI Training

Build comprehensive datasets of openly licensed images for training computer vision models without worrying about copyright infringement.

Content Creation & Curation

Source public domain and CC-licensed stock photos for websites, blogs, and marketing materials directly from an enormous catalog.

Academic Research

Gather metadata about image attribution, creator statistics, and license distribution across various providers for large-scale analysis.


Input Parameters

ParameterTypeRequiredDefaultDescription
keywordStringYesβ€”Image search keyword(s)
license_typeStringNoβ€”Comma-separated list of licenses (e.g., by, cc0)
sourceStringNoβ€”Comma-separated list of sources (e.g., flickr)
sortStringNoβ€”Sort order (e.g., relevance or newest)
results_wantedIntegerNo20Maximum results to collect

Output Data

Each extracted item in the dataset contains comprehensive image metadata:

FieldTypeDescription
idStringUnique OpenVerse identifier
titleStringImage title
urlStringDirect URL to the high-resolution image
foreign_landing_urlStringURL to the image on the original provider's website
creatorStringName of the image creator
licenseStringCreative Commons license type
providerStringName of the provider (e.g., flickr)
attributionStringComplete attribution string for easy use

Usage Examples

Basic Image Search

Extract 50 nature images:

{
"keyword":"nature",
"results_wanted":50
}

Specific License Extraction

Extract public domain (CC0) technology images from Flickr:

{
"keyword":"technology",
"license_type":"cc0",
"source":"flickr",
"results_wanted":100
}

Sample Output

{
"id":"260c0ca9-35a4-41a4-b1e0-e13339a2b31d",
"title":"Computer Test",
"foreign_landing_url":"https://www.flickr.com/photos/74362028@N00/2099250160",
"url":"https://live.staticflickr.com/2287/2099250160_e1ceb65c97_b.jpg",
"creator":"flatiron32",
"creator_url":"https://www.flickr.com/photos/74362028@N00",
"license":"by-nc",
"license_version":"2.0",
"provider":"flickr",
"source":"flickr",
"attribution":"\"Computer Test\" by flatiron32 is licensed under CC BY-NC 2.0. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc/2.0/.",
"mature":false,
"height":768,
"width":1024
}

Tips for Best Results

Optimize Collection Size

  • Start with a small results_wanted (like 20) to preview the output data structure.
  • Increase the limit once you confirm the parameters accurately target your needs.

Precise Filtering

  • Use specific license codes like pdm (Public Domain Mark), cc0, by, by-sa.
  • Providing a specific source (like wikimedia or rawpixel) yields more homogeneous data.

Integrations

Connect your extracted image data with:

  • Google Sheets β€” Export metadata for quick review
  • Airtable β€” Build searchable image databases
  • Make β€” Create automated data enrichment workflows

Export Formats

Download your extracted data in multiple formats:

  • JSON β€” For developers and APIs
  • CSV β€” For spreadsheet analysis

Frequently Asked Questions

Can I scrape multiple pages?

Yes, the actor automatically handles pagination to reach your desired result count.

Are these images free to use?

OpenVerse aggregates openly licensed and public domain works, but you must respect the provided license and attribution requirements for each image.

What if data is missing?

Some fields (like category or filesize) might be missing if the original provider doesn't supply them. The actor automatically removes null fields to keep your dataset clean.


Support

For issues or feature requests, contact support through the Apify Console.

Resources


Legal Notice

This actor is designed for legitimate data collection purposes. Users are responsible for ensuring compliance with website terms of service, honoring the specified image licenses, and adhering to applicable laws. Use data responsibly.

You might also like

Creative Commons Search Scraper (Openverse)

gio21/creative-commons-scraper

Search Creative Commons-licensed content via Openverse API. Get free-to-use images, audio with attribution.

OpenVerse Image Scraper

crawlerbros/pixabay-scraper

Search millions of Creative Commons licensed images from Flickr, Wikimedia, and museums via OpenVerse (api.openverse.org). Free, no API key required.

Bing Images Scraper - Low-costπŸ’²πŸ”₯πŸŒπŸ–ΌοΈ

delectable_incubator/bing-images-scraper---low-cost

Scrape Bing Images search results easily πŸ”πŸ–ΌοΈ with a powerful image scraper. Extract image URLs, descriptions, dimensions, and metadata for any keyword. Ideal for visual research, brand monitoring, content sourcing, creative inspiration, and image SERP analysis with structured datasets πŸ“ŠπŸš€

Pexels Stock Image Scraper

shahidirfan/Pexels-Stock-Image-Scraper

Bulk download high-resolution royalty-free images from Pexels. Capture image URLs, titles, photographer info, dimensions & metadata. Ideal for blog automation, design assets, content creation, AI training datasets & stock image libraries. Zero licensing restrictions.

Google Images Scraper

scrapepilotapi/google-images-scraper

Google Images Scraper πŸ”πŸ–ΌοΈ extracts image URLs, titles, sources, and metadata from Google Images at scale. Ideal for research, AI datasets, SEO analysis, and content sourcing. Fast, reliable, and customizable for efficient large-scale image data collection. βš‘πŸ“Š

Google Images Scraper

scrapapi/google-images-scraper

Extract image results from Google Images using the Google Images Scraper. Collect image URLs, titles, source websites, thumbnails, and search result data automatically. Ideal for research, dataset creation, SEO analysis, and visual content discovery.

Adobe Stock Scraper - Low-costπŸ’²πŸ”₯🎨

delectable_incubator/adobe-stock-scraper---low-cost

Scrape Adobe Stock media data by keyword 🎨. Extract asset titles, content types, thumbnails, download URLs, license details, and metadata. Ideal for stock content analysis, creative research, and building structured datasets for design, marketing, and media projects πŸ“ŠπŸš€

Bing Images Api

scraper-engine/bing-images-api

Bing Images API searches and retrieves image results from Bing. Extract image URLs, titles, sources, thumbnails, and metadata for any keyword. Ideal for image datasets, visual research, AI training data collection, and content discovery.

πŸ‘ User avatar

Scraper Engine

2

DuckDuckGo Images Scraper - Cheap πŸ–ΌοΈπŸ¦†βœ¨

scrapestorm/duckduckgo-images-scraper---cheap

πŸ–ΌοΈ Easily collect image search data from DuckDuckGo Search and extract structured image results including image URLs, thumbnails, titles, source pages, domains, sizes, positions & more🌍 Perfect for image research, visual SEO analysis, content creation, brand monitoring & creative inspiration 🎨

6

Related articles

Top 5 Google Image Search APIs to extract web image data
Read more