Patterns for Ingesting SaaS Data into AWS Data Lakes AWS Whitepaper
python custom connector SDK as well as java custom connector SDK for AppFlow. The AppFlow
Custom Connector SDK enables customers and third-party developers to build custom source
and/or destination connectors for the AppFlow service. With the SDK, you can connect to private
APIs, on-premise proprietary systems, and other cloud services by adding to AppFlow's library of
connectors.
If any of the following scenarios apply, other ingestion patterns discussed in this paper may be a
better fit for your type of ingestion:
• A supported application is heavily customized.
• Your use case exceeds any of the application-specific limitations.
For every SaaS application that Amazon AppFlow supports, there are a set of limitations included.
For example, if you are transferring more than one million Salesforce records, you cannot choose
any Salesforce compound field. Before using Amazon AppFlow, look for the limitations for the
application that you are planning to connect, evaluate your use case against those limitations, and
see if the service is still a good fit for what you are trying to do.
SaaS applications are sometimes heavily customized, so it’s always good to make sure the
edge cases can be solved with Amazon AppFlow. You can find the list of known limitations and
considerations in the notes section of the Amazon AppFlow documentation. For example, the
known limitations for Salesforce as a source are listed here.
Also, consider the Amazon AppFlow service quotas to ensure your use case fits well within those
limitations.
Extract, transform and load (ETL) using custom connectors with
Apache Spark
AWS Glue: Introduction
AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and
combine data for analytics, machine learning, and application development.AWS Glue provides all
the capabilities needed for data integration so that you can start analyzing your data and putting it
to use in minutes instead of months.
AWS Glue custom connectors makeit easy to discover and integrate with a variety of additional
data sources, such as SaaS applications or your custom data sources. With just a few clicks, you can
Extract, transform and load (ETL) using custom connectors with Apache Spark 4