Hello Airbyte Team,
I have been using Airbyte for approximately six months and am genuinely impressed with its capabilities!
However, I am encountering challenges with efficiently inserting data into my AWS S3-Glue data lake.
I have experimented with several destinations, but each seems to have its own issue:
1.
Destination-Data-Lake: Performance is slow and maintenance seems to be lacking. I have had an open
PR for about four months without resolution.
2.
Destination-S3: Encountering a bug related to data type handling when writing to Parquet (
issue with dictionary data types).
3.
Destination-Glue: Only supports JSON format, which is not optimal for our needs.
4.
Destination-Iceberg: Does not support the Glue data catalog.
Given these challenges, I am curious if there are any plans on the roadmap to enhance support for data lake operations. I believe my use case is fairly common and robust support could benefit many users.