Pricing
$9.99/month + usage
Jobspy - Linkedin, Glassdoor Scraper
JobSpy is a job scraping library with the goal of aggregating all the jobs from popular job boards with one tool. Scrapes job postings from LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter, & other job boards concurrently Aggregates the job postings in a dataframe Proxies support to bypass blocking
Pricing
$9.99/month + usage
Rating
0.0
(0)
Developer
Actor stats
0
Bookmarked
25
Total users
4
Monthly active users
6 months ago
Last modified
Categories
Share
JobSpy is a job scraping library with the goal of aggregating all the jobs from popular job boards with one tool.
Features
- Scrapes job postings from LinkedIn, Indeed, Glassdoor, Google, ZipRecruiter, & other job boards concurrently
- Aggregates the job postings in a dataframe
- Proxies support to bypass blocking
Installation
pip install -U python-jobspy
Python version >= 3.10 required
Usage
import csvfrom jobspy import scrape_jobsjobs = scrape_jobs(site_name=["indeed","linkedin","zip_recruiter","google"],# "glassdoor", "bayt", "naukri", "bdjobs"search_term="software engineer",google_search_term="software engineer jobs near San Francisco, CA since yesterday",location="San Francisco, CA",results_wanted=20,hours_old=72,country_indeed='USA',# linkedin_fetch_description=True # gets more info such as description, direct job url (slower)# proxies=["208.195.175.46:65095", "208.195.175.45:65095", "localhost"],)print(f"Found {len(jobs)} jobs")print(jobs.head())jobs.to_csv("jobs.csv", quoting=csv.QUOTE_NONNUMERIC, escapechar="\\", index=False)# to_excel
Output
SITE TITLE COMPANY CITY STATE JOB_TYPE INTERVAL MIN_AMOUNT MAX_AMOUNT JOB_URL DESCRIPTIONindeed Software Engineer AMERICAN SYSTEMS Arlington VA None yearly 200000150000 https://www.indeed.com/viewjob?jk=5e409e577046... THIS POSITION COMES WITH A 10K SIGNING BONUS!...indeed Senior Software Engineer TherapyNotes.com Philadelphia PA fulltime yearly 135000110000 https://www.indeed.com/viewjob?jk=da39574a40cb... About Us TherapyNotes is the national leader i...linkedin Software Engineer - Early Career Lockheed Martin Sunnyvale CA fulltime yearly None None https://www.linkedin.com/jobs/view/3693012711 Description:By bringing together people that u...linkedin Full-Stack Software Engineer Rain New York NY fulltime yearly None None https://www.linkedin.com/jobs/view/3696158877 Rainβs mission is to create the fastest and ea...zip_recruiter Software Engineer - New Grad ZipRecruiter Santa Monica CA fulltime yearly 130000150000 https://www.ziprecruiter.com/jobs/ziprecruiter... We offer a hybrid work environment. Most US-ba...zip_recruiter Software Developer TEKsystems Phoenix AZ fulltime hourly 6575 https://www.ziprecruiter.com/jobs/teksystems-0... Top Skills' Detailsβ’ 6 years of Java developme...
Parameters for scrape_jobs()
Optionalβββ site_name (list|str):| linkedin, zip_recruiter, indeed, glassdoor, google, bayt, bdjobs| (default is all)ββββ search_term (str)|βββ google_search_term (str)| search term for google jobs. This is the only param for filtering google jobs.ββββ location (str)ββββ distance (int):| in miles, default 50ββββ job_type (str):| fulltime, parttime, internship, contractββββ proxies (list):| in format ['user:pass@host:port', 'localhost']| each job board scraper will round robin through the proxies|βββ is_remote (bool)ββββ results_wanted (int):| number of job results to retrieve for each site specified in 'site_name'ββββ easy_apply (bool):| filters for jobs that are hosted on the job board site (LinkedIn easy apply filter no longer works)|βββ user_agent (str):| override the default user agent which may be outdatedββββ description_format (str):| markdown, html (Format type of the job descriptions. Default is markdown.)ββββ offset (int):| starts the search from an offset (e.g. 25 will start the search from the 25th result)ββββ hours_old (int):| filters jobs by the number of hours since the job was posted| (ZipRecruiter and Glassdoor round up to next day.)ββββ verbose (int) {0, 1, 2}:| Controls the verbosity of the runtime printouts| (0 prints only errors, 1 is errors+warnings, 2 is all logs. Default is 2.)βββ linkedin_fetch_description (bool):| fetches full description and direct job url for LinkedIn (Increases requests by O(n))ββββ linkedin_company_ids (list[int]):| searches for linkedin jobs with specific company ids|βββ country_indeed (str):| filters the country on Indeed & Glassdoor (see below for correct spelling)|βββ enforce_annual_salary (bool):| converts wages to annual salary|βββ ca_cert (str)| path to CA Certificate file for proxies
βββ Indeed limitations:| Only one from this list can be used in a search:|- hours_old|- job_type & is_remote|- easy_applyββββ LinkedIn limitations:| Only one from this list can be used in a search:|- hours_old|- easy_apply
Supported Countries for Job Searching
LinkedIn searches globally & uses only the location parameter.
ZipRecruiter
ZipRecruiter searches for jobs in US/Canada & uses only the location parameter.
Indeed / Glassdoor
Indeed & Glassdoor supports most countries, but the country_indeed parameter is required. Additionally, use the location
parameter to narrow down the location, e.g. city & state if necessary.
You can specify the following countries when searching on Indeed (use the exact name, * indicates support for Glassdoor):
| Argentina | Australia* | Austria* | Bahrain |
| Belgium* | Brazil* | Canada* | Chile |
| China | Colombia | Costa Rica | Czech Republic |
| Denmark | Ecuador | Egypt | Finland |
| France* | Germany* | Greece | Hong Kong* |
| Hungary | India* | Indonesia | Ireland* |
| Israel | Italy* | Japan | Kuwait |
| Luxembourg | Malaysia | Mexico* | Morocco |
| Netherlands* | New Zealand* | Nigeria | Norway |
| Oman | Pakistan | Panama | Peru |
| Philippines | Poland | Portugal | Qatar |
| Romania | Saudi Arabia | Singapore* | South Africa |
| South Korea | Spain* | Sweden | Switzerland* |
| Taiwan | Thailand | Turkey | Ukraine |
| United Arab Emirates | UK* | USA* | Uruguay |
| Venezuela | Vietnam* |
Bayt
Bayt only uses the search_term parameter currently and searches internationally
Notes
- Indeed is the best scraper currently with no rate limiting.
- All the job board endpoints are capped at around 1000 jobs on a given search.
- LinkedIn is the most restrictive and usually rate limits around the 10th page with one ip. Proxies are a must basically.
Frequently Asked Questions
Q: Why is Indeed giving unrelated roles?
A: Indeed searches the description too.
- use - to remove words
- "" for exact match
Example of a good Indeed query
search_term='"engineering intern" software summer (java OR python OR c++) 2025 -tax -marketing'
This searches the description/title and must include software, summer, 2025, one of the languages, engineering intern exactly, no tax, no marketing.
Q: No results when using "google"?
A: You have to use super specific syntax. Search for google jobs on your browser and then whatever pops up in the google jobs search box after applying some filters is what you need to copy & paste into the google_search_term.
Q: Received a response code 429?
A: This indicates that you have been blocked by the job board site for sending too many requests. All of the job board sites are aggressive with blocking. We recommend:
- Wait some time between scrapes (site-dependent).
- Try using the proxies param to change your IP address.
JobPost Schema
JobPostβββ titleβββ companyβββ company_urlβββ job_urlβββ locationβ βββ countryβ βββ cityβ βββ stateβββ is_remoteβββ descriptionβββ job_type: fulltime, parttime, internship, contractβββ job_functionβ βββ interval: yearly, monthly, weekly, daily, hourlyβ βββ min_amountβ βββ max_amountβ βββ currencyβ βββ salary_source: direct_data, description (parsed from posting)βββ date_postedβββ emailsLinkedin specificβββ job_levelLinkedin & Indeed specificβββ company_industryIndeed specificβββ company_countryβββ company_addressesβββ company_employees_labelβββ company_revenue_labelβββ company_descriptionβββ company_logoNaukri specificβββ skillsβββ experience_rangeβββ company_ratingβββ company_reviews_countβββ vacancy_countβββ work_from_home_type
