Talend Adds ‘Easy Button’ to Automate Big Data Integration in AWS Environments

Talend, a provider of big data integration software, has unveiled a new version of its Talend Integration Cloud, an integration platform-as-a-service (iPaaS) solution with enhancements to help companies manage on-premises, cloud, and hybrid integration workloads in their Amazon Web Services (AWS) environments.

Available now, Talend says its Integration Cloud Spring ’16 release  provides an "easy button” for data integration on AWS, allowing IT organizations to execute big data and data integration projects running on AWS Redshift or AWS Elastic MapReduce (EMR) with greater ease - using fewer resources, and at a reduced cost. 

“If you look at somebody trying to build a data flow and pulling information from sensors, there is a tremendous amount of hand-coding required and that person has to understand each of the services on top of the Amazon platform,” said Ashley Stirrup, chief marketing officer, Talend.  “But instead of having to hire high-end developers to do all that with hand-coding you can actually visually build a data flow inside the Talend Studio.” That, he said, opens up the cloud “to a much broader set of developers.”

The updated Talend Integration Cloud solution also enables developers and architects to bring native Spark processing capabilities into an iPaaS solution in order to support departments across the enterprise with real-time analytics.

According to Talend, the continued growth of big data, the Internet of Things (IoT), and expanding corporate data lakes is driving companies to move an increasing amount of data processing to the cloud. Citing Gartner’s prediction that enterprise iPaaS will continue to grow at a double-digit rate over the next 5 years, Talend says that its Integration Cloud provides the features that businesses require to keep pace with their growing data stores data and the related need for faster insights.

“Talend Integration Cloud is built on top of the Amazon and we have done some things with this release in particular to optimize for the Amazon platform, but we can absolutely orchestrate and run data integration jobs anywhere - whether that is on-premise, on Google, or on Azure, so customers have a tremendous amount of flexibility. And, just like they wouldn’t care what data center Salesforce runs on, customers shouldn’t care where Talend Integration cloud runs,” Stirrup said. In addition, he noted, the solution handles integration from both traditional and newer data sources.

Key features and benefits of the newest version of Talend Integration Cloud include Apache Kafka, Spark Batch, Streaming, and Spark MLlib (machine learning) capabilities that allow IT teams to enrich their real-time analytics, and build end-to-end intelligent data pipelines connecting sensors, weblogs, clickstreams and other big data sources.

“We have found that Spark running on top of Hadoop runs five times faster than MapReduce and that is for the batch data processing – and, with the Spark Streaming, you can support real-time use cases on top of that so there is a tremendous performance benefit,” said Stirrup. In addition, the support for machine learning allows IT to partner with data scientists so they can build their machine-learning algorithms and deploy them very quickly into production, he noted.

Beyond handling the automation of AWS EMR and Redshift jobs to help IT teams execute big data integration projects faster and control costs, the new release adds the ability to scale and optimize hybrid integration patterns to enable IT to expand structured and unstructured datasets without over-extending budgets, and improves visibility and control of enterprise cloud environments, the vendor says.

For more information about Talend Integration Cloud, go here.