Published In
Publication Number
Page Numbers
Paper Details
Leveraging Cloudera Big Data Platform with Spark ETL and Kafka for Data Processing in the Travel Industry with GDS Integration
Authors
Syed Ziaurrahman Ashraf
Abstract
Integrating Cloudera’s Big Data platform with Apache Kafka and Apache Spark creates a powerful architecture for real-time and batch data processing across industries, particularly in travel. This paper explores how Global Distribution Systems (GDS) in the travel industry can leverage these technologies to optimize data processing, enhance customer experiences, and improve operational efficiencies. We delve into the architecture, use cases, and benefits of this stack within the travel sector. The paper includes technical diagrams, pseudocode, and visual aids to provide an in-depth understanding of the implementation and its impact on GDS.
Keywords
Cloudera, Apache Spark, Apache Kafka, ETL, Real-time Processing, GDS, Travel Industry, Big Data, Data Pipeline, Streaming Data
Citation
Leveraging Cloudera Big Data Platform with Spark ETL and Kafka for Data Processing in the Travel Industry with GDS Integration. Syed Ziaurrahman Ashraf. 2019. IJIRCT, Volume 5, Issue 6. Pages 1-6. https://www.ijirct.org/viewPaper.php?paperId=2410015