Scalable Analytics on Heterogenous Semi-Structured Data
May 6, 2024 12:00 PM
In the modern big data ecosystem, there are plenty of solutions that allow one to perform fast analytics queries on very large datasets, but what if you have a few petabytes of arbitrary JSON? Well, then youโre going to need to build your own solution. In this talk, Dan will give an overview of how this issue was solved at Coralogix with DataPrime, a custom query language and distributed execution engine designed to enable fast analytics on arbitrary semi-structured data (aka JSON) in object storage.