How it works

The Google Cloud Storage connector pulls in all documents from the specified GCS bucket. It supports various file formats including PDF, DOC, DOCX, TXT, and more. Documents are updated every 1 day.

Setting up

Authorization

  1. Log into your Google Cloud Console.
  2. Navigate to “Cloud Storage” > “Settings” > “Interoperability”. Google Cloud Storage Interoperability
  3. Click “Create a key for a service account”.
  4. Click “Create new account” (or select an existing account and click Create key > skip to Step 8). New service account
  5. Set a name for the new service account (e.g., “onyx-gcs-connector”) and click “Create and continue”. Service account configuration
  6. Click “Select a role” and select Storage object viewer or another, more permissive role.
  7. Click “Done”. Service account permissions
  8. You should be back on the Cloud Storage Interoperability page and see the new credentials. Record both Access key and Secret. HMAC keys

Indexing

  1. Navigate to the Admin Panel > Add Connector > Google Storage.
  2. Click Create New to create a new credential. Onyx Google Cloud Storage credentials
  3. Paste your Access key and Secret (from Step 8) in the form. Click Create and Continue.
  4. Name your connector, specify the GCS bucket you would like to index, and click Create Connector. Onyx Connector creation

Understanding Google Cloud Storage Structure

Google Cloud Storage organizes data into buckets. Each bucket can contain an unlimited number of objects (files). You can think of a bucket as a root directory, and the objects as files within that directory. For more information on Google Cloud Storage structure, visit the Google Cloud Storage documentation.