This public dataset contains approximately 9 million URLs and metadata for images that have been annotated with labels spanning more than 6,000 categories.
You can start exploring this data in the BigQuery console:
Sample queries
Here are some examples of SQL queries you can run on this data in BigQuery.
These samples use BigQuery’s support for standard SQL. Use the #standardSQL
tag to let BigQuery know you want to use standard SQL. For more information
about the #standardSQL prefix, see
Setting a query prefix.
Which labels are in the dataset?
#standardSQL
SELECT
*
FROM
`bigquery-public-data.open_images.dict`
LIMIT
10;
The results are shown here:
+------------+--------------------+
| label_name | label_display_name |
+------------+--------------------+
| /m/0h989 | go |
| /m/03bx7vb | ox |
| /m/0m09 | ale |
| /m/0_k2 | ant |
| /m/01hf_2 | ape |
| /m/0dzf4 | arm |
| /m/0jjw | art |
| /m/0n5v01m | bag |
| /m/01nz0z | bar |
| /m/01h44 | bat |
+------------+--------------------+
Which labels have "bus" in their display names?
#standardSQL
SELECT
*
FROM
`bigquery-public-data.open_images.dict`
WHERE
label_display_name LIKE '%bus%'
LIMIT
20;
The results are shown here:
+------------+--------------------+
| label_name | label_display_name |
+------------+--------------------+
| /m/01bjv | bus |
| /m/04yqq2 | bust |
| /m/015zfz | airbus |
| /m/02539r | sorbus |
| /m/045jsc | minibus |
| /m/01jw_1 | bus stop |
| /m/0c5q0q | mi rebus |
| /m/05jlh5 | saltbush |
| /m/016_bh | shadbush |
| /m/02yvhj | school bus |
| /m/0f6pl | trolleybus |
| /m/015zbk | airbus a330 |
| /m/018rl2 | airbus a380 |
| /m/03qk36c | airport bus |
| /m/03_k0c | busy lizzie |
| /m/0hgryjx | business bag |
| /m/01kqwy | business jet |
| /m/012t_z | businessperson |
| /m/02w11w8 | tour bus service |
| /m/03n9vx | double-decker bus |
+------------+--------------------+
How many images of a trolleybus are in the dataset?
#standardSQL
SELECT
COUNT(*)
FROM
`bigquery-public-data.open_images.labels` a
INNER JOIN
`bigquery-public-data.open_images.images` b
ON
a.image_id = b.image_id
WHERE
a.label_name='/m/0f6pl'
AND a.confidence > 0.5;
The results are shown here:
+------+
| f0_ |
+------+
| 3550 |
+------+
What are some landing pages of images with a trolleybus?
#standardSQL
SELECT
original_landing_url,
confidence
FROM
`bigquery-public-data.open_images.labels` l
INNER JOIN
`bigquery-public-data.open_images.images` i
ON
l.image_id = i.image_id
WHERE
label_name='/m/0f6pl'
AND confidence = 1
AND subset='validation'
LIMIT
10;
+----------------------------------------------------------+------------+
| original_landing_url | confidence |
+----------------------------------------------------------+------------+
| https://www.flickr.com/photos/gazeronly/6356698903 | 1.0 |
| https://www.flickr.com/photos/hisgett/3453032426 | 1.0 |
| https://www.flickr.com/photos/metrocincinnati/4400806389 | 1.0 |
| https://www.flickr.com/photos/tjc/165330995 | 1.0 |
| https://www.flickr.com/photos/koraxdc/10888199614 | 1.0 |
| https://www.flickr.com/photos/toms/128871696 | 1.0 |
| https://www.flickr.com/photos/tadokoro/8615989093 | 1.0 |
| https://www.flickr.com/photos/sergejf/8706867707 | 1.0 |
| https://www.flickr.com/photos/daveiam/3492373572 | 1.0 |
| https://www.flickr.com/photos/cityoftoronto/10732215443 | 1.0 |
+----------------------------------------------------------+------------+
![]() |
![]() |
![]() |
![]() |
|
5 FULTON Transbay Terminal by torbakhopper under CC BY 2.0 |
Trolley Bus by Tony Hisgett under CC BY 2.0 |
Trolley bus #1472 by Metro Bus under CC BY 2.0 |
Intersection by TimothyJ under CC BY 2.0 |
Which images with cherries are in the training set?
#standardsql
SELECT
i.image_id AS image_id,
original_url,
confidence
FROM
`bigquery-public-data.open_images.labels` l
INNER JOIN
`bigquery-public-data.open_images.images` i
ON
l.image_id = i.image_id
WHERE
label_name='/m/0f8sw'
AND confidence >= 0.85
AND Subset='train'
LIMIT
10;
+------------------+-----------------------------------------------------------------+------------+
| image_id | original_url | confidence |
+------------------+-----------------------------------------------------------------+------------+
| 16abc5e3dd5aee38 | https://c2.staticflickr.com/4/3276/2734551390_b3b1f46826_o.jpg | 0.9 |
| 275344e5e05fbd55 | https://c2.staticflickr.com/6/5515/11645877016_a813d091c1_o.jpg | 0.9 |
| cd9f51a7d2909088 | https://c1.staticflickr.com/3/2661/3704400114_0f37df3c76_o.jpg | 0.9 |
| 87754460acc77207 | https://c1.staticflickr.com/5/4138/4913426822_a1539dc915_o.jpg | 0.9 |
| d923fb3fdb415915 | https://c1.staticflickr.com/9/8352/8303394799_d321c27b35_o.jpg | 0.9 |
| 0fbbf595e9eb88b1 | https://c1.staticflickr.com/1/202/508179173_0a112bdedd_o.jpg | 0.9 |
| 0485896eb3297811 | https://c2.staticflickr.com/8/7458/9221235455_02570f8348_o.jpg | 0.9 |
| 118491448098cb46 | https://c1.staticflickr.com/3/2428/3769786809_e12895e412_o.jpg | 0.9 |
| 847212d0c174bff7 | https://c2.staticflickr.com/4/3124/2397453988_1b3819bde3_o.jpg | 0.9 |
| 26ed578010c2126b | https://c2.staticflickr.com/8/7557/15584458587_beeaf99d1d_o.jpg | 0.9 |
+------------------+-----------------------------------------------------------------+------------+
![]() |
![]() |
![]() |
![]() |
|
Cherries by Kevin under CC BY 2.0 |
Bowl of cherries by Rebecca Wilson under CC BY 2.0 |
Cherries by liz west under CC BY 2.0 |
Cherry-o by d3adcrab under CC BY 2.0 |
About the data
Dataset Source: https://github.com/openimages/dataset
Category: Image, Creative Commons
APA-style citation: Google Research (2016). The Open Images dataset [Image urls and labels]. Available from github: https://github.com/openimages/dataset.
Use: The annotations are licensed by Google Inc. under CC BY 4.0 license.
The images referenced in the dataset are listed as having a CC BY 2.0 license. Note: while we tried to identify images that are licensed under a Creative Commons Attribution license, we make no representations or warranties regarding the license status of each image and you should verify the license for each image yourself.
Update Frequency: Quarterly
View in BigQuery: Go to Open Images dataset







