I've set up automatic data ingestion from a Cloud Storage bucket to a Datastore following the instructions in the official documentation (https://cloud.google.com/generative-ai-app-builder/docs/create-data-store-es#storage-periodic-sync). However, I'm experiencing some issues and would appreciate your insights:
1. Expected vs. Actual Ingestion Time:
- I set up the ingestion yesterday, but as of today, no txt files have been imported as documents in the Datastore.
- What's the typical timeframe for this process to complete?
- In the documentation, the default is 1 hour, but this was not the case.
```
After you set up your data source and import data the first time, data is synced from that source at a frequency that you select during setup. About an hour after the data connector is created, the first sync occurs.
```
- Is there a way to check the progress or status of the ingestion?
- Connector state is Active.
2. Troubleshooting Steps:
- Are there any common issues that might cause delays in ingestion?
- What logs or metrics should I check to diagnose potential problems?
3. Configuration Verification:
- How can I verify that my ingestion setup is correct?
- Are there any specific permissions or settings that are often overlooked?
In the documentation, it is specified that data cannot be manually refreshed.
Any guidance or insights would be greatly appreciated. Thank you in advance for your help!
Hi @pedropcamellon,
Welcome to Google Cloud Community!
Here are some insights into the issues you’re facing with the delayed data ingestion from Cloud Storage to Datastore:
1. Expected vs. Actual Ingestion Time:
According to this documentation:
Depending on the size of your data, ingestion can take several minutes to several hours.
After you set up your data source and import data the first time, data is synced from that source at a frequency that you select during setup. About an hour after the data connector is created, the first sync occurs. The next sync then occurs around 24 hours, 72 hours, or 120 hours later.
Data ingestion time can vary depending on factors like:
For checking the progress of ingestion process, you may try the following:
2. Troubleshooting Steps:
Here are some common issues that might cause delays in ingestion:
Here are some logs and metrics to check to diagnose potential problems:
3. Configuration Verification:
Here are some additional tips to consider:
I hope the above information is helpful.
Is there a way to force the SYNC?
I had erros one day and it stopped updating the data
Hi @gerson_neto ,
For connecting to Cloud Storage with periodic syncing:
Data is synced periodically to the entity data store. You can specify synchronization daily, every three days, or every five days.
I hope this helps.
Hi @ruthseki !
I have it set up to update daily, but since I had an error because I uploaded the wrong extension it is not updating anymore.
That is why I asked if is there a way to force update it.
Thanks for the