Add arguments to pass document parsing and chunking options to google_discovery_engine_data_store #17554

etekin-amfam · 2024-03-12T17:46:14Z

Google's VertexAI Search & Conversation Data Stores have further options to refine how documents are ingested that can significantly improve performance. However, these options need to be passed at creation, and are currently not supported by terraform. As such, we're having to revert to deploying these data stores manually instead of terraform., making things harder to manage. Adding these options to the terraform module so we can deploy data stores via terraform would make maintenance significantly easier and less manual.
For the arguments I'm describing, see https://cloud.google.com/generative-ai-app-builder/docs/parse-chunk-documents#parse-chunk-rag, mainly, a documentProcessingConfig field needs to be specifiable, as shown in https://cloud.google.com/generative-ai-app-builder/docs/parse-chunk-documents#example and https://cloud.google.com/generative-ai-app-builder/docs/parse-chunk-documents#turn-on-chunking

b/330174625

dariowho · 2024-04-29T16:14:06Z

Hi @rileykarson, I can take this one

dariowho · 2024-06-11T08:38:51Z

@etekin-amfam the PR for this issue does not include Layout Parsing and Chunking configuration, as those options were not available in either v1 or v1beta when this was implemented.

Those parameters are now available, I opened a separate issue to include them: #18390

rileykarson added enhancement size/s labels Mar 18, 2024

rileykarson added this to the Goals milestone Mar 18, 2024

rileykarson added the service/discoveryengine label Mar 18, 2024

modular-magician added the forward/linked label Mar 18, 2024

rileykarson assigned dariowho Apr 29, 2024

dariowho mentioned this issue May 22, 2024

[discoveryengine] Add documentProcessingConfig field to DataStore resource GoogleCloudPlatform/magic-modules#10765

Merged

dariowho mentioned this issue Jun 11, 2024

Add layout_parsing_config and chunking_config arguments to google_discovery_engine_data_store #18390

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add arguments to pass document parsing and chunking options to google_discovery_engine_data_store #17554

Add arguments to pass document parsing and chunking options to google_discovery_engine_data_store #17554

etekin-amfam commented Mar 12, 2024 •

edited by modular-magician

Loading

dariowho commented Apr 29, 2024

dariowho commented Jun 11, 2024

Add arguments to pass document parsing and chunking options to google_discovery_engine_data_store #17554

Add arguments to pass document parsing and chunking options to google_discovery_engine_data_store #17554

Comments

etekin-amfam commented Mar 12, 2024 • edited by modular-magician Loading

dariowho commented Apr 29, 2024

dariowho commented Jun 11, 2024

etekin-amfam commented Mar 12, 2024 •

edited by modular-magician

Loading