VOOZH about

URL: https://data.ontario.ca/about

⇱ About - Ontario Data Catalogue


Skip to content
  1. Home

About

The Ontario government generates and maintains thousands of datasets. Since 2012, we have shared data with Ontarians via a data catalogue.

Open Data

Open data is data that is shared with the public. Click here to learn more about open data and why Ontario releases it.

Ontario’s Digital and Data Directive states that all data must be open, unless there is good reason for it to remain confidential.

Ontario’s Chief Digital and Data Officer also has the authority to make certain datasets available publicly.

Other data

Datasets listed in the catalogue that are not open will have one of the following labels:

  • Under review – The dataset is still being screened. It may become open or it may be restricted and not released.
  • Restricted – The dataset has been screened and will not be released to the public. The record will be listed in the catalogue, but the data will not be shared.

Data licences and use

If you want to use data you find in the catalogue, that data must have a licence – a set of rules that describes how you can use it. A licence:

  • grants you rights to download and use the data
  • outlines your rights
  • tells you what terms and conditions apply

Most of the data available in the catalogue is released under Ontario’s Open Government Licence. However, each dataset may be shared with the public under other kinds of licences or no licence at all.

If a dataset doesn’t have a licence, you don’t have the right to use the data.

If you have questions about how you can use a specific dataset, please contact us.

Developers resources

Data

The Ontario Data Catalogue endeavors to publish open data in a machine readable format. For machine readable datasets, you can simply retrieve the file you need using the file URL.

The Ontario Data Catalogue is built on CKAN, which means the catalogue has the following features you can use when building applications.

APIs (Application programming interfaces) let software applications communicate directly with each other. If you are using the catalogue in a software application, you might want to extract data from the catalogue through the catalogue API.

Note: All Datastore API requests to the Ontario Data Catalogue must be made server-side.

Catalogue API

The catalogue's collection of dataset metadata (and dataset files) is searchable through the CKAN API. The Ontario Data Catalogue has more than just CKAN's documented search fields. You can also search these custom fields. You can also use the CKAN API to retrieve metadata about a particular dataset and check for updated files.

Read the complete documentation for CKAN's API.

Datastore API

Some of the open data in the Ontario Data Catalogue is available through the Datastore API. You can also search and access the machine-readable open data that is available in the catalogue.

How to use the API feature:

  1. Find your dataset.
    👁 A screenshot of the dataset search results page. The user has searched for the word pepper and 3 results are displayed.
    Figure 3 – Dataset search results for “pepper”.
  2. Click Preview to go to the file you want to access through the API.
    👁 A screenshot of the data and resources section of the dataset description page. Each file in the dataset is listed. To the right of each file name is a preview button and a download button. An arrow is overlaid onto the screeshot that points to the preview button.
    Figure 4 – Click the Preview button.
  3. Click the Data API button.
    👁 A screenshot of the file description page. The data api button is found in the top right corner of the page.
    Figure 5 – Click the Data API button and follow the instructions.

Read the complete documentation for CKAN's Datastore API.

Training materials

How to use the catalogue

The Ontario Data Catalogue contains a record for each dataset that the Government of Ontario possesses. Some of these datasets will be available to you as open data. Others will not be available to you. This is because the Government of Ontario is unable to share data that would break the law or put someone's safety at risk.

Search

You can search for a dataset with a word that might describe a dataset or topic. Use words like “taxes” or “hospital locations” to discover what datasets the catalogue contains.

You can search for a dataset from 3 spots on the catalogue:

  1. the homepage,
  2. the dataset search page, or
  3. the menu bar available across the catalogue.

On the dataset search page, you can also filter your search results. You can select filters on the left hand side of the page to limit your search for datasets with your favourite file format, datasets that are updated weekly, datasets released by a particular ministry, or datasets that are released under a specific licence. Go to the dataset search page to see the filters that are available to make your search easier.

You can also do a quick search by selecting one of the catalogue’s categories on the homepage. These categories can help you see the types of data we have on key topic areas.

When you find the dataset you are looking for, click on it to go to the dataset record.

Dataset record

Each dataset record will tell you whether the data is available, and, if so, tell you about the data available.

Open Data

An open dataset might contain several data files. These files might represent different periods of time, different sub-sets of the dataset, different regions, language translations, or other breakdowns. You can select a file and either download it or preview it. Make sure to read the licence agreement to make sure you have permission to use it the way you want.

Non-open data

A non-open dataset may be not available for many reasons.

Read more about non-open data.

Read more about restricted data.

Data that is non-open may still be subject to freedom of information requests.

Visualizing data

The catalogue has tools that enable all users to visualize the data in the catalogue without leaving the catalogue – no additional software needed.

Keeping updated

Get automatic notifications when datasets are updated. You can choose to get notifications for individual datasets, an organization’s datasets or the full catalogue.

You don’t have to provide and personal information – just subscribe to our feeds using any feed reader you like using the corresponding notification web addresses. Copy those addresses and paste them into your reader.

👁 A screenshot of the configuration screen of a RSS reader. The feed url for the data catalogue has been entered into the form field for the feed url.
Figure 1 - Pasting the notification address for the full catalogue into the feed reader.

Your feed reader will let you know when the catalogue has been updated.

👁 A screenshot of an RSS reader displaying recently updated datasets from the data catalogue. The latest four datasets are listed.
Figure 2 – Receiving notifications on the latest updates to the catalogue.

File Types

The catalogue provides open data in several file formats (e.g., spreadsheets, geospatial data, etc). Learn about each format and how you can access and use the data each file contains.

CSV - Comma Separated Values

A file that has a list of items and values separated by commas without formatting (e.g. colours, italics, etc.) or extra visual features. This format provides just the data that you would display in a table. XLSX (Excel) files may be converted to CSV so they can be opened in a text editor.

How to access the data: Open with any spreadsheet software application (e.g., Open Office Calc, Microsoft Excel) or text editor.

Note: This format is considered machine-readable, it can be easily processed and used by a computer. Files that have visual formatting (e.g. bolded headers and colour-coded rows) can be hard for machines to understand, these elements make a file more human-readable and less machine-readable.

Txt – Text file

A file that provides information without formatted text or extra visual features that may not follow a pattern of separated values like a CSV.

How to access the data: Open with any word processor or text editor available on your device (e.g., Microsoft Word, Notepad).

XLS/XLSX – Excel Spreadsheet

A spreadsheet file that may also include charts, graphs, and formatting.

How to access the data: Open with a spreadsheet software application that supports this format (e.g., Open Office Calc, Microsoft Excel). Data can be converted to a CSV for a non-proprietary format of the same data without formatted text or extra visual features.

SHP – Shapefile

A shapefile provides geographic information that can be used to create a map or perform geospatial analysis based on location, points/lines and other data about the shape and features of the area. It includes required files (.shp, .shx, .dbt) and might include corresponding files (e.g., .prj).

How to access the data: Open with a geographic information system (GIS) software program (e.g., QGIS).

Zip – Compressed file

A package of files and folders. The package can contain any number of different file types.

How to access the data: Open with an unzipping software application (e.g., WinZIP, 7Zip).

Note: If a ZIP file contains .shp, .shx, and .dbt file types, it is an ArcGIS ZIP: a package of shapefiles which provide information to create maps or perform geospatial analysis that can be opened with ArcGIS (a geographic information system software program).

GeoJSON – Geographic JavaScript Object Notation

A file that provides information related to a geographic area (e.g., phone number, address, average rainfall, number of owl sightings in 2011 etc.) and its geospatial location (i.e., points/lines).

How to access the data: Open using a GIS software application to create a map or do geospatial analysis. It can also be opened with a text editor to view raw information.

Note: This format is machine-readable, and it can be easily processed and used by a computer. Human-readable data (including visual formatting) is easy for users to read and understand.

JSON - JavaScript Object Notation

A text-based format for sharing data in a machine-readable way that can store data with more unconventional structures such as complex lists.

How to access the data: Open with any text editor (e.g., Notepad) or access through a browser.

Note: This format is machine-readable, and it can be easily processed and used by a computer. Human-readable data (including visual formatting) is easy for users to read and understand.

XML - Extensible Markup Language

A text-based format to store and organize data in a machine-readable way that can store data with more unconventional structures (not just data organized in tables).

How to access the data: Open with any text editor (e.g., Notepad).

Note: This format is machine-readable, and it can be easily processed and used by a computer. Human-readable data (including visual formatting) is easy for users to read and understand.

KML- Keyhole Markup Language

A file that provides information related to an area (e.g., phone number, address, average rainfall, number of owl sightings in 2011 etc.) and its geospatial location (i.e., points/lines).

How to access the data: Open with a geospatial software application that supports the KML format (e.g., Google Earth).

Note: This format is machine-readable, and it can be easily processed and used by a computer. Human-readable data (including visual formatting) is easy for users to read and understand.

IVT – Table files

This format contains files with data from tables used for statistical analysis and data visualization of Statistics Canada census data.

How to access the data: Open with the Beyond 20/20 application.

Application/ msaccess – MS Access Database

A database which links and combines data from different files or applications (including HTML, XML, Excel, etc.).

The database file can be converted to a CSV/TXT to make the data machine-readable, but human-readable formatting will be lost.

How to access the data: Open with Microsoft Office Access (a database management system used to develop application software).

PDF - Portable Document Format

A file that keeps the original layout and formatting of a page. The contents of a PDF cannot be edited directly.

How to access the data: Open with web browsers, PDF readers, and word processors.

DOC/DOCX – Word Document

A text file which can include images, tables and many other formatting options.

How to access the data: Open with doc/docx compatible word processors.

RTF – Rich Text Format

A text file which can include basic text formatting and images.

How to access the data: Opens with most word processors (e.g., OpenOffice).

Interpreting data

Open data on the Ontario Data Catalogue will vary in how it is organized and formatted. Depending on the licence agreement for a dataset, you can choose to organize and format the file to suit your intended use.

Some of the open data on the Ontario Data Catalogue will have a data dictionary. A data dictionary can help you understand what the dataset means.

If you need further understanding the data, reach out to the dataset point of contact.

What do we tell you about data (metadata)?

Each dataset listed in the catalogue might have the following details as part of its description:

ID (database name: id)

A unique set of numbers and letters.

Example value: 027f8366-1edb-45e7-9631-8e6a2b28267e

URL (database name: url)

The URL of the page in the catalogue that lists the dataset.

Example value: https://data.ontario.ca/dataset/ontario-top-baby-names-male

Title (database name: title)

A unique title that describes the data.

Example value: Abandoned airports

Name (database name: name)

Same as title, except with dashes ("-") instead of spaces (" ").

Example value: abandoned-airports

Description (database name: notes)

A brief introduction that helps users understand the data.

Example value: *Abandoned airports and airports that are not recognized at the national level with an airport identifier.

This layer is derived from data provided by Nav Canada. This layer should not be used for navigation purposes.

Official LIO title: Airport Other*

Last Validated Date (database name: current_as_of)

The last date someone responsible for the data reviewed the data files and their information to confirm it was still current.

Example value: 20190516T16:06:15+00:00

Date Opened (database name: opened_date)

The date that the data files were first posted to the Open Data Catalogue (and shared with the public).

Example value: 20190516T16:06:15+00:00

Update Frequency (database name: update_frequency)

How often the data maintainers plan to update the data.

Example value: yearly

Date created (database name: metadata_created)

The date that the data is first listed on the open data catalogue.

Example value: 20190516T16:06:15+00:00

Tags (database name: keywords)

Terms that you might use when describing this data.

Example value: airports, aviation, transportation, land use

Geographic Coverage (database name: geographic_coverage)

A term describing the geographic boundaries of this data.

Example value: Ontario

Access Level (database name: access_level)

The status of data that has been reviewed or is being reviewed to see if it is safe to share with the public on the Open Data Catalogue.

Example value: open

Access Instructions (database name: access_instructions)

Instructions on accessing the data if it isn’t directly accessible (i.e. accessible through one click).

Example value: Navigate to https://www.javacoeapp.lrc.gov.on.ca/geonetwork/srv/en/main.home?uuid=1d9372b7-3786-448f-b72a-1da2aa1ab6ce and select the “Airport Other - shape file download package” link. Then provide your email, name, and organization and the shapefile will be emailed to the address you provided.

Licence (database name: license_id)

An ID that corresponds to the terms and conditions of the licence.

Example value: Ontario Open Government Licence

Creator (database name: creator_user_id)

The ID of the employee who initially listed the data in the catalogue.

Example value: Joe Smith

Ministry (database name: owner_org)

The ministry that is responsible for the data.

Example value: Ministry of Labour

Maintainer (database name: maintainer)

The name of the person or group that can be contacted with questions about the data.

Example value: Centre of Excellence for Evidence-Based Decision Making

Maintainer Email (database name: maintainer_email)

The email address of the person or group that can be contacted with questions about the data.

Example value: joe.smith@ontario.ca

Author (database name: author)

The name of the person or group that is the original author of the data.

Example value: Centre of Excellence for Evidence-Based Decision Making

Author Email (database name: author_email)

The email address of the person or group that is the original author of the data.

Example value: putting_outcomes_first@ontario.ca

Maintainer Branch (database name: maintainer_branch)

The Ontario government branch responsible for maintaining the data.

Example value: CoE for Evidence-Based Decision Making

Exemption (database name: exemption)

The reason why the data was not shared with the public.

Example value: privacy

Exemption Rationale (database name: exemption_rationale)

An explanation as to why the data was not shared with the public.

Example value: Dataset contains confidential client contact/personal information.

Files

Each dataset listed in the catalogue might include multiple related files. Each file might have the following details in its description:

ID (database name: id)

A unique set of numbers and letters.

Example value: 027f8366-1edb-45e7-9631-8e6a2b28267e

File (database name: url)

The web address where you can find the file on the internet.

Example value: https://www.javacoeapp.lrc.gov.on.ca/geonetwork/srv/en/main.home?uuid=1d9372b7-3786-448f-b72a-1da2aa1ab6ce

Name (database name: name)

A unique title for the file that helps you quickly understand what it is.

Example value: Airport Other - shape file download package

Description (database name: description)

A text introduction to the data that helps you understand the file in detail.

Example value: Airports that have been classified as abandoned or are not recognized at the national level with an airport identifier. This layer is derived from data provided from Nav Canada as well as from the existing airports NRVIS data class.

Data Range Start (database name: data_range_start)

The start-date and time for the data in the file.

Example value: 2019-05-16

Data Range End (database name: data_range_end)

The end-date and time for the data in the file.

Example value: 2019-05-16

Data made public date (database name: date_publicly_available)

The date the file became available to the public.

Example value: 2019-05-16

Data last updated (database name: data_last_updated)

The date the data within the file was last updated.

Example value: 2019-05-16

Added to Catalogue (database name: created)

The date and time that the file was first listed in the catalogue.

Example value: 20190516T16:06:15+00:00

Type (database name: type)

A label that helps you understand whether a file contains data or just helps you use or understand data.

Example value: data

Format (database name: format)

The extension of the file.

Example value: xls

Language (database name: language)

The language the information in the file is stored in.

Example value: English

Size (database name: size)

The amount of space the data file takes up on a computer.

Example value: 1078

Engagement

We built Ontario’s data catalogue to give you better access to government data.

Here is how you can give us feedback:

  • Contact us.

    As we improve the catalogue, we want you to tell us what you think of it. This catalogue is for you, so the more feedback we get, the better our final product.

  • Participate in user testing.

    Work with us directly by testing the catalogue in our offices. If you are interested in doing this, contact us.

  • Complete a survey.

    You may come across a survey while using the catalogue. Please complete the survey and tell us how we’re doing.