Google Storage Connector
Access documents stored in Google Cloud Storage buckets
How it works
The Google Cloud Storage connector pulls in all documents from the specified GCS bucket. It supports various file formats including PDF, DOC, DOCX, TXT, and more.
Documents are updated every 1 day.
Setting up
Authorization
-
Log into your Google Cloud Console.
-
Navigate to “IAM & Admin” > “Service Accounts”.
-
Click “Create Service Account”.
-
Set a name for the new service account (e.g., “onyx-gcs-connector”) and click “Create”.
-
For the role, select “Storage Object Viewer” or another appropriately permissive role and click “Continue”.
-
Click “Done” to finish creating the service account.
-
On the Service Accounts page, find your newly created account and click on it.
-
In the “Keys” section, click “Add Key” > “Create new key”.
-
Choose “JSON” as the key type and click “Create”.
-
A JSON file will be downloaded to your computer. This file contains your credentials.
-
Open the JSON file and note down the following information:
project_id
client_id
(this will be used as your Access Key ID)private_key
(this will be used as your Secret Access Key)
Indexing
- Navigate to the Admin Dashboard and select the Google Cloud Storage Connector.
- In Step 1, provide your GCS credentials:
- Provide your GCS Project ID, Access Key ID, and Secret Access Key for authentication.
- These credentials will be used to access your GCS buckets.
GCS Project ID: Access Key ID: Secret Access Key:
-
Click “Update” to save your credentials.
-
In Step 2, specify which GCS bucket you want to make searchable:
- Click “Connect” to begin indexing.
Understanding Google Cloud Storage Structure
Google Cloud Storage organizes data into buckets. Each bucket can contain an unlimited number of objects (files). You can think of a bucket as a root directory, and the objects as files within that directory.
For more information on Google Cloud Storage structure, visit the Google Cloud Storage documentation.