BoxLoader and BoxBlobLoader
The langchain-box
package provides two methods to index your files from Box: BoxLoader
and BoxBlobLoader
. BoxLoader
allows you to ingest text representations of files that have a text representation in Box. The BoxBlobLoader
allows you download the blob for any document or image file for processing with the blob parser of your choice.
This notebook details getting started with both of these. For detailed documentation of all BoxLoader features and configurations head to the API Reference pages for BoxLoader and BoxBlobLoader.
Overview
The BoxLoader
class helps you get your unstructured content from Box in Langchain's Document
format. You can do this with either a List[str]
containing Box file IDs, or with a str
containing a Box folder ID.
The BoxBlobLoader
class helps you get your unstructured content from Box in Langchain's Blob
format. You can do this with a List[str]
containing Box file IDs, a str
containing a Box folder ID, a search query, or a BoxMetadataQuery
.
If getting files from a folder with folder ID, you can also set a Bool
to tell the loader to get all sub-folders in that folder, as well.
A Box instance can contain Petabytes of files, and folders can contain millions of files. Be intentional when choosing what folders you choose to index. And we recommend never getting all files from folder 0 recursively. Folder ID 0 is your root folder.
The BoxLoader
will skip files without a text representation, while the BoxBlobLoader
will return blobs for all document and image files.
Integration details
Class | Package | Local | Serializable | JS support |
---|---|---|---|---|
BoxLoader | langchain_box | ✅ | ❌ | ❌ |
BoxBlobLoader | langchain_box | ✅ | ❌ |