Apache Spark has become the de facto standard for processing data at scale, whether for querying large datasets, training machine learning models to predict future trends, or processing streaming data ...
What I'd like to cover here goes beyond those AI headlines, however, and involves a special nugget just for folks doing data engineering, analytics and machine learning work with Apache Spark.