How to Connect to BigQuery and Manage Your Dataset – Streamline

Integrating Google BigQuery with Streamline lets you leverage cloud-native analytics at scale. This guide walks you through how to connect a BigQuery data source and create reusable datasets for your workflows.

Note: This article is specific to Google BigQuery. If you're working with another database, see our Data Source Library for the appropriate guide.

Part 1: Connect to a BigQuery Data Source

Step 1: Go to the Integrations Page

From your dashboard, navigate to the Integrations section. Click + Add New connection.

Step 2: Select BigQuery as Your Data Source

In the list of data sources, select BigQuery.

Name your connection clearly (e.g., Marketing_BQ, BQ_Analytics_Prod).

Tip: Use a name that helps team members identify the connection’s data purpose or source project.

Step 3: Authenticate Your Account

To connect BigQuery, you'll need to authenticate using Project ID and Sa Key:

Security Tip: Limit access to only the necessary datasets or projects via IAM roles in GCP.

Step 4: Select the Schema

Select your Schema & click Connect to complete setup. Your BigQuery source will now appear in your Integrations list, ready to use.

Step 5: Explore Your BigQuery Data

Navigate to the Data Catalog to see the available BigQuery tables and fields. Use Entity View to browse datasets and tables.

Use Data Field View to explore individual fields, including types and classifications.

Tip: You can filter by Data Source type, search, and sort through tables to quickly find what you need for dataset creation.

Part 2: Create and Manage a Dataset from BigQuery

With BigQuery connected, you can organize cloud-based analytics data into structured datasets for use in projects and workflows.

Step 1: Access the Datasets Section

Go to the Datasets section from the platform’s navigation. Click + Create New Dataset.

Step 2: Define Your Dataset

Enter a descriptive name for the dataset & select your BigQuery data source.

Choose a Primary Entity (a BigQuery table) that will serve as the main data source.

Optionally, assign a label for easier understanding (e.g., rename customer_id to Client ID).

Step 3: Configure Field Access

Review and select fields to include in the dataset. Configure write access if applicable (BigQuery is generally read-only, unless you're using advanced write-back tools).

Click Create when ready.

Alternative: Create a Dataset from the Data Catalog

You can also create a dataset directly from the schema browser:

In the Data Catalog, select a BigQuery table. Click New Dataset from Entity.

Choose the fields and optionally update classifications as covered in previous steps.

You can link additional BigQuery tables to your dataset:

While building your Dataset, click Add Related Entity. Select from available BigQuery tables that have logical relationships with your primary table (e.g., join orders to customers).

Note: BigQuery doesn't enforce foreign keys, but common key fields (like customer_id) are used to relate data.

Optional: Edit an Existing Dataset

To update a dataset:

Go to Datasets and click Edit on the one you want to modify.

You can:

Add/remove related entities
Update labels or classifications
Reorder or hide fields

Click Next & Update to save changes.

Summary

By connecting to BigQuery, you can tap into scalable, cloud-native datasets and organize them for streamlined use in your workflows. Whether you’re building datasets from the catalog or refining them through the dataset manager, you’re in control of how cloud data fits into your project’s structure.

Need help troubleshooting? Check out the BigQuery FAQ or reach out to Support for assistance with credentials, IAM roles, or schema setup.