Skip to main content
Open In ColabOpen on GitHub

Google Cloud Storage File

Google Cloud Storage is a managed service for storing unstructured data.

This covers how to load document objects from an Google Cloud Storage (GCS) file object (blob).

%pip install --upgrade --quiet  langchain-google-community[gcs]
from langchain_google_community import GCSFileLoader
API Reference:GCSFileLoader
loader = GCSFileLoader(project_name="aist", bucket="testing-hwc", blob="fake.docx")
loader.load()
/Users/harrisonchase/workplace/langchain/.venv/lib/python3.10/site-packages/google/auth/_default.py:83: UserWarning: Your application has authenticated using end user credentials from Google Cloud SDK without a quota project. You might receive a "quota exceeded" or "API not enabled" error. We recommend you rerun `gcloud auth application-default login` and make sure a quota project is added. Or you can use service accounts instead. For more information about service accounts, see https://cloud.google.com/docs/authentication/
warnings.warn(_CLOUD_SDK_CREDENTIALS_WARNING)
[Document(page_content='Lorem ipsum dolor sit amet.', lookup_str='', metadata={'source': '/var/folders/y6/8_bzdg295ld6s1_97_12m4lr0000gn/T/tmp3srlf8n8/fake.docx'}, lookup_index=0)]

If you want to use an alternative loader, you can provide a custom function, for example:

from langchain_community.document_loaders import PyPDFLoader


def load_pdf(file_path):
return PyPDFLoader(file_path)


loader = GCSFileLoader(
project_name="aist", bucket="testing-hwc", blob="fake.pdf", loader_func=load_pdf
)
API Reference:PyPDFLoader

Was this page helpful?

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy