Research Blog: datasets

See through the clouds with Earth Engine and Sentinel-1 Data

Monday, August 03, 2015

Posted by Luc Vincent, Engineering Director, Geo ImageryGoogle Earth EngineEuropean Geosciences Union General AssemblyIEEE Geoscience and Remote Sensing Society

Noel Gorelick presenting Google Earth Engine at EGU 2015.

European Commission Joint Research CentreWageningen UniversityUniversity of Paviautilizing the Earth Engine geospatial analysis platformCopernicus Sentinel-1Copernicus

Sentinel-1 data visualized using Earth Engine, showing Vienna (left) and Milan (right).

Wind farms seen off the Eastern coast of England.

A Multilingual Corpus of Automatically Extracted Relations from Wikipedia

Tuesday, June 02, 2015

Posted by Shankar Kumar, Google Research Scientist and Manaal Faruqui, Carnegie Mellon University PhD candidateNatural Language ProcessingOttawaCanadais the capital ofQuestion AnsweringGoogle Translate

Relation extraction in a Spanish sentence using the cross-lingual relation extraction pipeline.

Multilingual Open Relation Extraction Using Cross-lingual Projection2015 Conference of the North American Chapter of the Association for Computational Linguistics – Human Language TechnologiestuplesWikipediareleasing a datasetREADME fileCreative Commons Attribution-ShareAlike 3.0 License

Teaching machines to read between the lines (and a new corpus with entity salience annotations)

Monday, August 25, 2014

Posted by Dan Gillick, Research Scientist, and Dave Orr, Product ManagerPenn Treebanklots of linguistic dataNew York Times Annotated Corpususe of the metadataa forum581 word articlemore like 5 in 330,000 wordsTFIDF

Congratulations to Becky Hammon, first female NBA coach! Image via Wikipedia.

Knowledge GraphFreebase entity IDsA New Entity Salience Task with Millions of Training Examplesword sense disambiguationpreviously touched on

from Google Driveour Google Code site

CDC Birth Vital Statistics in BigQuery

Friday, January 13, 2012

Posted by Dan Vanderkam, Software EngineerBigQuery Servicelarge, public data setsnatalityDivision of Vital StatisticsCenters for Disease Control and Preventionsince 1969

examples

More Google Cluster Data

Tuesday, November 29, 2011

Posted by John Wilkes, Principal Software Engineerresearch blog on Google Cluster Data

the original resource requests, to permit scheduling experiments

request constraints and machine attriibutes

machine availability and failure events

some of the reasons for task exits

(obfuscated) job and job-submitter names, to help identify repeated or related jobs

more types of usage information

CPI (cycles per instruction) and memory traffic for some of the machines

this link

Slicing and dicing data for interactive visualization

Monday, February 28, 2011

Posted by Benjamin Yolken, Google Public Data Product ManagerGoogle Public Data Explorer31 datasetslabor productivityInternet speedgender balance in parliamentsgovernment debt levelspopulation density by municipalitydataset upload interfaceDataset Publishing Languageconceptsslicesdimensionsmetricsread morePublic Data Explorer

Google Research Blog

See through the clouds with Earth Engine and Sentinel-1 Data

A Multilingual Corpus of Automatically Extracted Relations from Wikipedia

Teaching machines to read between the lines (and a new corpus with entity salience annotations)

CDC Birth Vital Statistics in BigQuery

More Google Cluster Data

Slicing and dicing data for interactive visualization

Labels

Archive

Feed

Company-wide

Products

Developers