Apache Spark | WiseWithData

When did PySpark become THE modernization path for legacy SAS?

by Ian J. Ghent | Mar 2, 2023 | Apache Spark, Apache Spark Cafe, Python, SAS

Back in 2015, when my colleagues and I first started reading about and experimenting with Apache Spark and PySpark, we knew there was something special brewing in open analytics. Being in the data and analytics world since 1997, I’ve seen lots of trends come and go....

What does a 100x improvement really look like?

by Ian J. Ghent | Sep 28, 2022 | Apache Spark, Apache Spark Cafe, Python

I’m often called upon to provide guidance on why adopting modern analytics is so important. In 2022, all the best features and functionality are in the open source world, especially the market leading Databricks and the PySpark language. Those capabilities are...

My Tumultuous, Sometimes Frustrating Yet Thrilling Journey from SAS to Python and PySpark

by Luigi (Lou) Di Serio | Aug 16, 2022 | Apache Spark, Apache Spark Cafe, Python, SAS

Mental Illness and Obsolete Skills October 2019 was the beginning of a very difficult and dark period for me. In truth, the decline began like a slow-motion train wreck during Christmas of 2018, but the main crisis point and collapse was triggered in October 2019. In...

Planning a Prison Break – Breaking Free of Vendor Lock-in

by Ian J. Ghent | Jul 25, 2022 | Apache Spark Cafe, Cloud Computing, Python, SAS

It seems like everything these days is only for rent. The way we consume movies, music, books, computing, even the features in our cars, have all moved to be subscription model services, not products you buy and own. Pay once, own forever has increasingly become pay...

Foolishly Automating Or Automating Fools

by Ian J. Ghent | Mar 18, 2022 | Apache Spark, Apache Spark Cafe, Cloud Computing, Python, SAS

What an absurd title! A strange twist on the “working hard or hardly working” cliché, but read on and I promise it will all make sense. Here’s a tale of a court jester and his great invention. Long ago in a vast and complex kingdom, there lived a great king. The...

Why R is not ouR target – problems with the open source SAS competitor

by Ian J. Ghent | Mar 1, 2022 | Apache Spark, Apache Spark Cafe, Python, R, SAS

Back in 2015, when we set out to build SPROCKET, the World’s only SAS modernization solution, one key design question plagued our thoughts. Scalable, simple, fast and open-source, it was obvious from the early days of Apache Spark that it was analytics platform...

« Older Entries