docs: Comprehensive SAP HANA Enterprise CDC documentation update #69105

devin-ai-integration · 2025-10-30T17:06:22Z

What

Updates the SAP HANA Enterprise connector documentation to provide comprehensive CDC setup guidance and fix technical inaccuracies. This addresses the user request to improve CDC documentation accuracy and model the structure after the Db2 connector documentation.

Key improvements:

Complete CDC setup section with Python automation script
Detailed CDC behavior and prerequisites documentation
Fixed typos and formatting issues (e.g., "n." typo, "Prequisities" → "Prerequisites", markdown escaping)
Enhanced configuration reference with missing parameters (database, filters)
Updated features table to include "Replicate Incremental Deletes"

Requested by ian.alton@airbyte.io - Session: https://app.devin.ai/sessions/275c29b177454508a5b15b7945240b3e

How

Restructured documentation following Db2 connector model:
- Simplified Getting Started section
- Comprehensive CDC section with prerequisites, setup script, configuration, behavior, and considerations
- Enhanced data type mapping table with consistent formatting
Added Python CDC setup script (330 lines) to automate:
- CDC schema creation (_ab_cdc)
- Tracking table creation with before/after columns
- Trigger creation for INSERT/UPDATE/DELETE operations
- Supports multiple workflows (all tables in schema, specific tables, file input)
Documented CDC behavior including:
- Initial snapshot phase vs incremental CDC phase
- Automatic cleanup of processed change records
- CDC metadata fields added to records
- Permission requirements (setup-time vs runtime)
Fixed technical issues:
- Corrected markdown escaping \(Yes/No\) → (Yes/No)
- Fixed typo "n." at end of opening paragraph
- Fixed spelling "Prequisities" → "Prerequisites"
- Changed uppercase Airbyte type names to lowercase (e.g., BOOLEAN → boolean)
- Added grammar fix "with is" → "which is"

Review guide

High Priority - Requires Testing

Python CDC Setup Script (cdc_setup_sap_hana.py in documentation)
- ⚠️ NOT TESTED on actual SAP HANA database - created from code analysis
- Review SQL syntax for SAP HANA compatibility (especially CREATE TRIGGER syntax)
- Verify column type handling for VARCHAR, DECIMAL, etc.
- Check trigger naming convention and referencing syntax
- Validate connection handling with hdbcli library
Trigger Creation Syntax
- Lines 268-287: Verify REFERENCING NEW AS N / OLD AS O syntax
- Lines 289-298: Validate trigger body INSERT statement structure
- Confirm AFTER INSERT/UPDATE/DELETE vs BEFORE timing

Medium Priority - Technical Accuracy

CDC Behavior Documentation
- Lines 459-497: Verify CDC behavior descriptions match actual connector implementation
- Confirm initial snapshot timeout behavior
- Validate automatic cleanup description
Permission Requirements
- Lines 485-494: Validate setup-time vs runtime permission breakdown
- Confirm CREATE SCHEMA, CREATE TABLE, CREATE TRIGGER privileges are accurate
- Verify SELECT and DELETE permissions on tracking tables
Configuration Parameters
- Line 545: Added database parameter - verify this is correct for multi-tenant systems
- Line 547: Added filters parameter - confirm this exists in connector spec

Low Priority - Editorial

Content Structure
- Compare with Db2 docs to ensure similar flow
- Verify all sections are appropriate for SAP HANA (some details may differ from Db2)

User Impact

Positive:

Users can now easily set up CDC for SAP HANA using the provided Python script
Clear documentation of CDC prerequisites, behavior, and permissions reduces setup errors
Fixed typos and improved formatting enhance readability
Comprehensive examples help users get started quickly

Potential Negative:

If Python script has bugs, users may encounter errors during CDC setup (requires DBA involvement to fix)
Documentation may need updates after testing script on actual SAP HANA

Can this PR be safely reverted and rolled back?

YES 💚

This is a documentation-only change with no code modifications to the connector itself. Reverting would restore the previous (less comprehensive) documentation.

Note: The Python CDC setup script included in the documentation has not been tested on an actual SAP HANA database and should be validated before users rely on it for production setups.

- Restructure documentation to follow Db2 connector model - Add comprehensive CDC setup guide with Python script - Include detailed trigger-based CDC implementation explanation - Add CDC prerequisites, configuration, and behavior sections - Improve data type mapping table formatting - Add configuration reference with database parameter - Fix typos and improve clarity throughout Co-Authored-By: ian.alton@airbyte.io <ian.alton@airbyte.io>

devin-ai-integration · 2025-10-30T17:06:25Z

Original prompt from ian.alton@airbyte.io

@Devin Run !connectordocs on the SAP Hana Enterprise Connector. No specific commit, just use the current state of the connector today. Focus in particular on ensuring CDC documentation is correct.

As much as possible, it would be helpful if the SAP Hana documentation had a similar flow to the Db2 documentation here: <https://docs.airbyte.com/integrations/enterprise-connectors/source-db2-enterprise>. Although it's unlikely the two work in exactly the same way, you can use the Db2 content as a model for the type of result you want to have here. Just know that SAP Hana and Db2 are not the same thing.

As a starting point, here is the investigation you completed earlier:

_"SAP Hana Enterprise Connector CDC Implementation_

I've thoroughly investigated the SAP Hana Enterprise connector's Change Data Capture (CDC) implementation. Here's a comprehensive breakdown of how it works:

_Overview: Trigger-Based CDC Approach_

The SAP Hana connector uses a _trigger-based CDC approach_ rather than log-based CDC (like Debezium). This means it relies on database triggers that capture changes to trigger tables in a dedicated schema.

<https://github.com/airbytehq/airbyte/blob/074e6ccd/docs/integrations/enterprise-connectors/source-sap-hana.md#L43-L53|airbytehq/airbyte:source-sap-hana.md:43-53>



_Configuration_

CDC is configured through two main settings:



1. _Initial Load Timeout_: Determines how long the initial snapshot can run before switching to incremental CDC (4-24 hours, default 8 hours)
2. _Invalid CDC Cursor Position Behavior_: Controls whether to fail sync or re-sync data when the cursor position becomes invalid
<https://github.com/airbytehq/airbyte-enterprise/blob/bf5df191/airbyte-integrations/connectors/source-sap-hana/src/main/kotlin/io/airbyte/integrations/source/sap_hana/SapHanaSourceConfigurationSpecification.kt#L322-L353|airbytehq/airbyte-enterprise:SapHanaSourceConfigurationSpecification.kt:322-353>

The configuration is parsed and stored as a `CdcIncremental... (14502 chars truncated...)

devin-ai-integration · 2025-10-30T17:06:26Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

github-actions · 2025-10-30T17:06:50Z

👋 Greetings, Airbyte Team Member!

Here are some helpful tips and reminders for your convenience.

Helpful Resources

Breaking Changes Guide - Breaking changes, migration guides, and upgrade deadlines
Developing Connectors Locally
Managing Connector Secrets
On-Demand Live Tests
On-Demand Regression Tests
#connector-ci-issues
#connector-publish-updates
#connector-build-statuses

PR Slash Commands

Airbyte Maintainers (that's you!) can execute the following slash commands on your PR:

/format-fix - Fixes most formatting issues.
/bump-version - Bumps connector versions.
- You can specify a custom changelog by passing changelog. Example: /bump-version changelog="My cool update"
- Leaving the changelog arg blank will auto-populate the changelog from the PR title.
/run-cat-tests - Runs legacy CAT tests (Connector Acceptance Tests)
/build-connector-images - Builds and publishes a pre-release docker image for the modified connector(s).
JVM connectors:
- /update-connector-cdk-version connector=<CONNECTOR_NAME> - Updates the specified connector to the latest CDK version.
  Example: /update-connector-cdk-version connector=destination-bigquery
- /bump-bulk-cdk-version type=patch changelog='foo' - Bump the Bulk CDK's version. type can be major/minor/patch.
Python connectors:
- /poe connector source-example lock - Run the Poe lock task on the source-example connector, committing the results back to the branch.
- /poe source example lock - Alias for /poe connector source-example lock.
- /poe source example use-cdk-branch my/branch - Pin the source-example CDK reference to the branch name specified.
- /poe source example use-cdk-latest - Update the source-example CDK dependency to the latest available version.

📝 Edit this welcome message.

devin-ai-integration · 2025-10-30T17:07:06Z

AI-Generated Documentation Update

This PR was created by Devin (AI) based on a request from ian.alton@airbyte.io to update the SAP HANA Enterprise connector documentation with comprehensive CDC setup guidance.

Review Notes:

The documentation has been restructured to follow the Db2 connector documentation model
A Python CDC setup script has been added but has not been tested on an actual SAP HANA database
The script was created through code analysis of the connector implementation
Reviewers should validate the SQL syntax and trigger creation logic for SAP HANA compatibility

Reviewer Actions:

✅ Merge if the documentation improvements are accurate and helpful
✏️ Modify if changes are needed (feel free to push commits or request changes)
❌ Close if you disagree with the approach or content

Devin Session: https://app.devin.ai/sessions/275c29b177454508a5b15b7945240b3e

You can also review and modify this work directly in the Devin webapp IDE at the session link above.

github-actions · 2025-10-30T17:08:07Z