Top 5 challenges and recipes when starting Apache Spark project

Intro Apache Spark is the most advanced and powerful computation engine intended for big data analytics. Beside data engine it also provides libraries for streaming (Spark Streaming), machine learning (MLlib) and graph processing (GraphX). Historically Spark emerged as a successor of Hadoop ecosystem with a following key advantages: Spark provides…

How to work with Big Data from Java Spring applications

Intro This article shows how to get around all tough stuff related to Big Data infrastructure, how to work with data fast and comfortably, without thinking about code deployment, keeping focused on business goals and getting things done as quickly as possible. And Zentadata Platform is an answer. It is…

How Data Driven Enterprise makes business effective?

Intro There is a lot of information about benefits of Data Driven Enterprise. With the most prominent characteristics like following: Leverage data to prove multiple theories and choose the best one Continuously research business data to find new opportunities Seamless integration of ML into business processes to gain extra revenue…