<@U04SE163WC8> how to backfill google search conso...
# ask-ai
s
@kapa.ai how to backfill google search console data with Airbyte
k
To backfill Google Search Console data with Airbyte, you need to set up the Google Search Console connector and configure it to replicate data from a specific start date. Here are the steps to achieve this: ### Prerequisites - A verified property in Google Search Console (or the list of the
Site URLs
(Website URL Properties)). - Google Search Console API enabled for your project (for Airbyte Open Source). ### Step-by-Step Guide #### For Airbyte Cloud: 1. Log in to your Airbyte Cloud account: [Airbyte Cloud](https://cloud.airbyte.com/workspaces). 2. In the left navigation bar, click Sources. In the top-right corner, click + New source. 3. Find and select Google Search Console from the list of available sources. 4. For Source name, enter a name to help you identify this source. 5. For Website URL Property, enter the specific website property in Google Search Console with data you want to replicate. 6. For Start Date, enter the date from which you want to start backfilling data in the format
YYYY-MM-DD
. Any data created on or after this date will be replicated. 7. To authenticate the connection: - Select Oauth from the Authentication dropdown, then click Sign in with Google to authorize your account. 8. (Optional) For End Date, you may optionally provide a date in the format
YYYY-MM-DD
. Any data created between the defined Start Date and End Date will be replicated. Leaving this field blank will replicate all data created on or after the Start Date to the present. 9. (Optional) For Custom Reports, you may optionally provide an array of JSON objects representing any custom reports you wish to query the API with. Refer to the [Custom reports](https://docs.airbyte.com/integrations/sources/google-search-console#custom-reports) section for more information. 10. (Optional) For Data Freshness, you may choose whether to include "fresh" data that has not been finalized by Google. It is recommended to leave this option to its default value of
final
if you are using Incremental sync mode. 11. Click Set up source and wait for the tests to complete. #### For Airbyte Open Source: 1. Follow steps 1-6 as mentioned above. 2. To authenticate the connection: - (Recommended) Select Service Account Key Authorization from the Authentication dropdown, then enter the Admin Email and Service Account JSON Key. - Alternatively, select Oauth from the Authentication dropdown, then enter your Client ID, Client Secret, Access Token, and Refresh Token. 3. Follow steps 8-11 as mentioned above.
To backfill Google Search Console data with Airbyte, you need to set up the Google Search Console connector and configure it to replicate data from a specific start date. Here are the steps to achieve this: ### Prerequisites - A verified property in Google Search Console (or the list of the
Site URLs
(Website URL Properties)). - Google Search Console API enabled for your project (for Airbyte Open Source). ### Step-by-Step Guide #### For Airbyte Cloud: 1. Log in to your Airbyte Cloud account: [Airbyte Cloud](https://cloud.airbyte.com/workspaces). 2. In the left navigation bar, click Sources. In the top-right corner, click + New source. 3. Find and select Google Search Console from the list of available sources. 4. For Source name, enter a name to help you identify this source. 5. For Website URL Property, enter the specific website property in Google Search Console with data you want to replicate. 6. For Start Date, enter the date from which you want to start backfilling data in the format
YYYY-MM-DD
. Any data created on or after this date will be replicated. 7. To authenticate the connection: - Select Oauth from the Authentication dropdown, then click Sign in with Google to authorize your account. 8. (Optional) For End Date, you may optionally provide a date in the format
YYYY-MM-DD
. Any data created between the defined Start Date and End Date will be replicated. Leaving this field blank will replicate all data created on or after the Start Date to the present. 9. (Optional) For Custom Reports, you may optionally provide an array of JSON objects representing any custom reports you wish to query the API with. Refer to the [Custom reports](https://docs.airbyte.com/integrations/sources/google-search-console#custom-reports) section for more information. 10. (Optional) For Data Freshness, you may choose whether to include "fresh" data that has not been finalized by Google. It is recommended to leave this option to its default value of
final
if you are using Incremental sync mode. 11. Click Set up source and wait for the tests to complete. #### For Airbyte Open Source: 1. Follow steps 1-6 as mentioned above. 2. To authenticate the connection: - (Recommended) Select Service Account Key Authorization from the Authentication dropdown, then enter the Admin Email and Service Account JSON Key. - Alternatively, select Oauth from the Authentication dropdown, then enter your Client ID, Client Secret, Access Token, and Refresh Token. 3. Follow steps 8-11 as mentioned above. ### Supported Sync Modes The Google Search Console Source connector supports the following sync modes: - [Full Refresh - Overwrite](https://docs.airbyte.com/understanding-airbyte/connections/full-refresh-overwrite/) - [Full Refresh - Append](https://docs.airbyte.com/understanding-airbyte/connections/full-refresh-append) - [Incremental - Append](https://docs.airbyte.com/understanding-airbyte/connections/incremental-append) - [Incremental - Append + Deduped](https://docs.airbyte.com/understanding-airbyte/connections/incremental-append-deduped) The granularity for the cursor is 1 day, so Incremental Sync in Append mode may result in duplicating the data. For more detailed information, you can refer to the [Google Search Console setup guide](https://docs.airbyte.com/integrations/sources/google-search-console#step-2-set-up-the-google-search-console-
s
@kapa.ai how to find the earliesr available date for Google Search console data
k
A message from kapa.ai
To find the earliest available date for Google Search Console data, you can follow these steps: ### Step-by-Step Guide 1. Log in to Google Search Console: - Go to the [Google Search Console](https://search.google.com/search-console) and log in with your Google account. 2. Select the Property: - Choose the website property for which you want to find the earliest available data. 3. Navigate to Performance Report: - In the left-hand menu, click on Performance. 4. Set the Date Range: - Click on the date filter at the top of the report. - Select the Custom date range option. - Manually set the start date to a very early date (e.g., several years back) and the end date to today. - Apply the date range. 5. Check the Data Availability: - The report will show data starting from the earliest available date up to the end date you set. - Note the earliest date shown in the report. This is the earliest available date for your Google Search Console data. ### Using Airbyte to Backfill Data When setting up the Google Search Console connector in Airbyte, you can use the earliest available date as the Start Date to backfill data from that point onwards. #### For Airbyte Cloud: 1. Log in to your Airbyte Cloud account: [Airbyte Cloud](https://cloud.airbyte.com/workspaces). 2. In the left navigation bar, click Sources. In the top-right corner, click + New source. 3. Find and select Google Search Console from the list of available sources. 4. For Source name, enter a name to help you identify this source. 5. For Website URL Property, enter the specific website property in Google Search Console with data you want to replicate. 6. For Start Date, enter the earliest available date you found in the format
YYYY-MM-DD
. 7. Complete the authentication and other optional fields as needed. 8. Click Set up source and wait for the tests to complete. For more detailed information, you can refer to the [Google Search Console setup guide](https://docs.airbyte.com/integrations/sources/google-search-console#step-2-set-up-the-google-search-console-connector-in-airbyte). ### Additional Information - Google Search Console retains data for websites from the last 16 months. Any data prior to this cutoff point will not be accessible. For more information, see the [Data retention](https://docs.airbyte.com/integrations/sources/google-search-console#limitations--troubleshooting) section in the Airbyte documentation. By following these steps, you can determine the earliest available date for your Google Search Console data and configure Airbyte to backfill data from that date.
s
@kapa.ai do Google Search console exclude any data before sending it via Airbyte? Because the numbers I see from google search console UI is different from the google search console report
k
A message from kapa.ai
New message text here