Dealing with Data

This document describes how to deal with BigQuery data, such as setting parameters and handling nested and repeated fields.

Parameters

REST API methods accept three types of parameters: path parameters, query parameters, and body parameters. The following method signature demonstrates all three parameter types:

PUT https://www.googleapis.com/bigquery/v2/projects/{projectId}/datasets/{datasetId}/tables/{tableId}?userIp="192.0.2.211"
{
   "friendlyName": string,
   "description": string
}

projectId , datasetId, and tableId are all path parameters
userIp is a query parameter
friendlyName and description are both body parameters

The API documentation lists all the query parameters defined specifically by BigQuery. Query parameters that apply to all operations are shown below.

Setting Parameters

Different client libraries expose different techniques for setting these different types of parameters. For example, when using the Python client, you set path and query parameters the same way, but use a different method to set body parameters:

updateResponse = tableCollection.update(projectId='1234',           # Path param
                                        datasetId='5678',           # Path param
                                        tableId='9012',             # Path param
                                        userIp='192.0.2.211'        # Query param
                                        body={'friendlyName':'Donut Count',                 # Body params
                                              'description':'Worldwide donut usage count'}) #

Query parameters that apply to all Google BigQuery API operations are shown in the table below.

Notes (on API keys and auth tokens):

The key parameter is required with every request, unless you provide an OAuth 2.0 token with the request.
You must send an authorization token with every request that requires an OAuth scope. OAuth 2.0 is the only supported authorization protocol.
You can provide an OAuth 2.0 token with any request in one of two ways:
- Using the access_token query parameter like this: ?access_token=oauth2-token
- Using the HTTP Authorization header like this: Authorization: Bearer oauth2-token

All parameters are optional except where noted.

Parameter	Meaning	Notes
`access_token`	OAuth 2.0 token for the current user.	One possible way to provide an OAuth 2.0 token.
`callback`	Callback function.	Name of the JavaScript callback function that handles the response. Used in JavaScript JSON-P requests.
`fields`	Selector specifying a subset of fields to include in the response.	For more information, see the partial response documentation. Use for better performance.
`key`	API key. (REQUIRED*)	*Required unless you provide an OAuth 2.0 token. Your API key identifies your project and provides you with API access, quota, and reports. Obtain your project's API key from the Google Cloud Platform Console.
`prettyPrint`	Returns response with indentations and line breaks.	Returns the response in a human-readable format if `true`. Default value: `true`. When this is `false`, it can reduce the response payload size, which might lead to better performance in some environments.
`quotaUser`	Alternative to `userIp`.	Lets you enforce per-user quotas from a server-side application even in cases when the user's IP address is unknown. This can occur, for example, with applications that run cron jobs on App Engine on a user's behalf. You can choose any arbitrary string that uniquely identifies a user, but it is limited to 40 characters. Overrides `userIp` if both are provided. Learn more about Capping API usage.
`userIp`	IP address of the end user for whom the API call is being made.	Lets you enforce per-user quotas when calling the API from a server-side application. Learn more about Capping API usage.

Paging Through list Results

All collection.list methods return paginated results under certain circumstances. The number of results per page is controlled by the maxResults property.

Method	Pagination criteria	Default `maxResults` value	Maximum `maxResults` value
`Tabledata.list`	Returns paginated results if the response size is more than 10 MB of serialized JSON or more than `maxResults` rows.	100,000	100,000
All other `collection.list` methods	Returns paginated results if the response is more than `maxResults` rows.	50	1,000

If you set maxResults to a value greater than the maximum value listed above, the results are paginated based on the maximum value.

A page is a subset of the total number of rows. If your results are more than one page of data, the result data will have a pageToken property. To retrieve the next page of results, make another list call and include the token value as a URL parameter named pageToken.

The bigquery.tabledata.list method, which is used to page through table data, uses a row offset value or a page token. See Browsing Through Table Data for information.

The following samples demonstrate paging through bigquery results.