With Scala Days 2015 San Francisco just around the corner (and only 15% of tickets left), it has got me thinking quite a bit about how much the ecosystem has expanded since I first became involved with the conference in 2011. 

The rapidly-growing Scala community has evolved from what was largely a very academic and research-oriented crew, with some early champions like Twitter and Foursquare, to a language that’s become a standard for enterprises, start-ups and universities alike. 

But even as companies and individuals use Scala to build their own new ideas, they also utilize other excellent tools like Play Framework, Akka, Apache Spark and Kafka...which are not only some of the hottest tools and projects on the market right now, but also intentionally built in Scala (for many reasons…)


In my previous post Why Enterprises of different sizes are adopting ‘Fast Data’ with Apache Spark, I gave a quick introduction to how massive petabyte data sets proved to be unmanageable in a cost-effective way with traditional tools, which paved the way for Hadoop and NoSQL databases. Hadoop has traditionally been an environment for batch processing, while NoSQL databases provided some subset of record-oriented CRUD operations. More recently, the need to process event streams has become more important. My Typesafe colleague Jonas Bonér calls this “Fast Data”.



A couple of weeks ago, Typesafe launched the results of a survey in which over 2000 people were asked about the explosive adoption of Apache Spark. In the Slideshare presentation embedded above, you can see a sneak preview of some of the results of Apache Spark: Preparing for the Next Wave of Reactive Big Databut the full version has a lot more to offer. The Scala community is showing intense interest in Apache Spark as well (according to the report, 88% of Spark users are working in Scala, 44% in Java, 22% in Python). So as resident “Apache Spark guy”, I thought it would be nice to put the popularity of Apache Spark in context, looking at what led us here, how enterprises are reacting, and what the needs of the mid-market really are.


Back in September, we ran a survey to gather people’s thoughts and upgrade plans around Java 8. We were surprised to find that among the 3,000 respondents, more than 17% are already using Apache Spark in production. Considering how Spark support by the major Hadoop vendors is only about a year old, this number took many by surprise.