A Map of Akka

26.7.2015 | 8 minutes reading time

The amazing Akka project was started by Jonas Bonér in 2009 with the aim to bring the actor model , which has proven to deliver an availability of six nines (99.9999%) and even more, to the JVM. Akka, which is open source and available under the Apache 2 license , offers APIs for both Java and Scala. If you are interested in Akka’s history, take a look at the Akka 5 years anniversary blog post.

Over the years Akka has matured, is widely used and recently even won the 2015 JAX Award for Most Innovative Open Source Technology. Since its early days Akka has grown a lot which can easily been seen by looking at the number of sub-projects under the root project on GitHub .

So why should you consider using Akka? What does it offer? In this blog post we take a look at the most important sub-projects and their features from a bird’s eye perspective in order to give you an overview of Akka’s overall capabilities. We are planning – no promise made – to take some deep-dives in follow-up posts.

Akka Actors

The akka-actor module is Akka’s heart and soul, it’s the foundation on top of which all other modules and features are built. Essentially, it provides an implementation of the actor model without any notion of remoting, cluster awareness, persistence, etc.

Interestingly Jonas Bonér once told me that remoting, which initially was an integral part of Akka actors , would never be factored out into some sub-module – as you can see, some things change. What remained, though, was the design for distribution: In Akka, everything is distributed by default. The network and its peculiarities are not hidden away, but instead embraced.

So what does akka-actor, which defines actors as the fundamental building blocks of your programs, give you? Here are the main features:

Loose coupling through share-nothing and asynchronous messaging
Resilience because of compartmentalization and delegation of failure handling
Elasticity thanks to location transparency

Before we take a closer look at these features, we want to encourage you to read the Reactive Manifesto which describes typical requirements and traits of “modern” systems, e.g. highly available websites or other mission critical servers. It’s a quick read and while not entering completely unknown terrain, it defines a coherent vocabulary to talk about some of the things which matter in IT today.

Let’s get back to the features of Akka actors. Basically, in Akka everything is an actor and – according to Carl Hewitt, the inventor of the actor model, “one actor is no actor” – they come in systems. Actors share nothing, i.e. they discard the “shared” from “shared mutable state” – the root of all evil from a concurrency perspective. Actors exclusively communicate via asynchronous messages which – along with the share nothing approach – leads to rigorous decoupling and gives the other party the chance to be temporarily unavailable. Contrast that with synchronous method calls like known from mainstream imperative OO programming: Until the called object gets back with a return value, the caller is blocked. Ouch!

Another nasty thing which might happen when using synchronous method calls are exceptions. Well, on the one hand you know that something went wrong. But on the other hand it has become your responsibility to take action to fix the problem. To make this more obvious, think of a vending machine which took your money but didn’t deliver the snack you eagerly wanted to eat. What do you do? Maybe kick the machine, but certainly not fix it, that’s someone other’s job. Most probably you’ll survive without the snack or just try to find some other machine that works.

With actors, in the case of failure, you just don’t get an answer to your message – that’s like not getting your snack. But the failure is delegated to some other actor that supervises the faulty one, because in Akka every actor has a parent which supervises all its child actors. It’s the supervisor’s responsibility to decide how to proceed with the faulty actor, e.g. restart or stop it. As a result, communication – sending a message and hoping for a response – is decoupled from failure handling. That means that failure is restricted to the faulty actor and its supervisor and it doesn’t spread towards the caller. In other words, failure is compartmentalized, which means that only a part of the system is affected instead of the whole one.

Last but not least, it’s not important to know the physical location of an actor to talk to it – this is called location transparency. That’s because each actor has a logical address you use to talk to it; it’s physical location is hidden from you, decoupling you from it. Therefore, even if an actor resides on a remote node – which requires using Akka Remoting mentioned below – someone can send messages to the remote actor’s address without being aware of the fact that the actor isn’t part of the local actor system.

To sum it up, Akka actors enable you – while on a very low level – to write systems which are quite reactive. Of course you need distribution for real resilience and scalability, but Akka actors bring all the foundations needed – the rest is covered by other modules and features.

Akka Remoting

akka-remote is an extremely important module, because it enables remote communication and real location transparency. But apart from a couple of configuration settings, it keeps back and essentially works as an enabler.

If you want to enable remoting, you just have to override some default configuration settings:

akka {
  actor {
    // The default is "akka.actor.LocalActorRefProvider"
    provider = "akka.remote.RemoteActorRefProvider"
  }
  remote {
    netty.tcp {
      hostname = "127.0.0.1" // that's the default
      port     = 9001        // the default is 2552
    }
  }
}

Essentially, you just have to configure the RemoteActorRefProvider. This allows you to have actors deployed on remote actor systems, including remote death watch, failure detection, etc. While this is fantastic, it’s too low level for most cases, because it requires you to know the exact remote addresses of the collaborating actor systems.

Akka Cluster

This is where Akka Cluster – which is comprised of several modules, e.g. akka-cluster, akka-cluster-tools or akka-cluster-sharding – gets into the game. At its core it provides a membership service which allows actor systems to join and/or leave a cluster. Any actor can register as a listener for cluster events, e.g. MemberUp or MemberRemoved, which allows these actors to dynamically gain knowledge about potential remote communication partners. In order to provide a consistent view of the current cluster state, a distributed failure detector monitors the health of the individual member nodes and possibly declares member nodes unreachable which results in UnreachableMember events.

While you can use the cluster events directly, you most probably encounter them implicitly, because they are the foundation of a couple of higher-level features, e.g.:

Cluster-aware routers: routees can either be created or looked up on remote member nodes
Cluster Singleton: only one instance of a particular actor in the cluster
Cluster Sharding: distribute a potentially large number of actors across the member nodes
Distributed Data: consistent data replication without central coordination based on CRDTs

Akka Persistence

There are various reasons for an actor to get restarted, e.g. in reaction to a program failure (exception) or a hardware or network failure (remote nodes becoming unavailable). As actors totally hide away their internal state – if any –, the only general way to restore an actor into the same state after restarting is via sending it the same messages like before.

Obviously this is a great fit for Event Sourcing and that’s exactly what Akka Persistence is all about: restoring an actor’s state by applying the concepts of Event Sourcing. Therefore it distinguishes between commands and events. If a persistent actor receives a command, it might create an event, ask Akka Persistence’s journal – there are numerous journal backends, e.g. based on Cassandra or Kafka – to persist it and once that’s confirmed apply the event to its state. During recovery all the events are replayed which leads to the same state like before. Of course there’s also support for snapshots to avoid long recovery times for large numbers of events.

Akka Streams and Akka HTTP

Akka Streams and Akka HTTP are the new kids on the block. They are still experimental and not yet part of the “official” Akka distribution, meaning that they have their own version number – 1.0 at the time of writing this post. It’s planned to make them proper citizens of Akka 2.4, though, which is supposed to be released in the foreseeable future.

Akka Streams is an implementation of Reactive Streams which has been specified and implemented by a number of parties including Reactor, RxJava, Slick and Vert.x. Reactive Streams is all about asynchronous stream processing with non-blocking back pressure and Akka Streams – obviously – uses actors for the implementation.

A perfect use case for Akka Streams is Akka HTTP which is the evolution of the very successful spray project: A HTTP server accepts a stream of HTTP requests and produces a stream of HTTP responses. Also, the bodies of HTTP entities which essentially are one or even more chunks of data, can be nicely expressed as streams of bytes.

Conclusion

We have given an overview of a couple of Akka modules from the very low-level and essential Akka actors which “simply” implement the actor model up to high-level abstractions like Akka Cluster, Akka Persistence and Akka HTTP which are all built on top of the foundation provided by Akka actors. Therefore each of the modules gives you the benefits of the actor-model: loose coupling, resilience and elasticity.

As already mentioned we are planning to write some follow-up posts which cover the individual modules in greater depth. Questions and feedback are highly appreciated.

Was this post helpful?

Blog author

Heiko Seeberger

Do you still have questions? Just send me a message.

fromHeiko Seeberger

Scala Arrays – functional vs imperative

The Scala collections , which are part of the standard library, are known for their vast amount of high-level functional operations like map, flatMap, filter, sliding or groupBy, just to name a handful. These not only allow for high developer productivity...

Scala

15.2.2016 | 5 minutes reading time

Heiko Seeberger

Phantom Types in Scala

Inspired by a recent conversation with my former colleague Brendan McAdams and my current coworker Markus Hauck , I decided to put together a quick post about phantom types, a topic perfectly suited for demonstrating the power of the type system of ...

Scala

5.2.2016 | 5 minutes reading time

Heiko Seeberger

Getting started with Akka Cluster

In a previous part of this series about Akka we introduced the core abstraction provided by Akka: actors. Now we want to take a look at how these can be used in a cluster, i.e. in a distributed system. Cluster Membership The akka-cluster module provides...

Reactive Programming

11.1.2016 | 5 minutes reading time

Heiko Seeberger

Monads demystified

In this short post I want to take a look at monads from a pragmatic perspective, i.e. why and how monads can be useful for developers. I won’t talk about any theory, but instead show code examples in Scala. I’ll even call things monad which don’t fully...

Functional programming
Scala

8.12.2015 | 3 minutes reading time

Heiko Seeberger

Introduction to Akka Actors

In the first part of this series we gave you a high-level overview of Akka – now we are going to take a deep dive into the realm of Akka actors as provided by the akka-actor module which lay the foundations for all other Akka modules. As we believe...

Reactive Programming

16.8.2015 | 11 minutes reading time

Heiko Seeberger

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Macro annotations in Scala 3

In a previous blog post we took a look at macro annotations in Scala 2, where they have been present for a while. Only recently they have been added to Scala 3 as well, specifically in the pre-release version 3.3.0-RC2 of the Dotty compiler. Same as...

Scala

4.4.2023 | 9 minutes reading time

Lukas Lehmann

Macro annotations in Scala 2

In this blog post we will take a look at macro annotations, a powerful tool for code transformation and generation in Scala. Macro annotations allow us to transform the code of a definition, e.g., a class or method, at compile time. This can be used ...

Scala

28.3.2023 | 12 minutes reading time

Lukas Lehmann

Hit me baby one more time – What are cache hits and why should you care...

MotivationWhen reasoning about algorithm performance we often look at complexity. Especially when comparing different algorithms, looking at asymptotic complexity (e.g. the big-O notation) is useful. We have to keep in mind, however, that the big-O ...

APM
Software development
Scala

6.12.2019 | 11 minutes reading time

Microbenchmarking your Scala code

Motivation I am sure you recognize this loading spinner icon. I do not know anyone who likes to wait for the computer. However, when writing software I usually favour readability, maintainability, and extensibility over speed. I agree with Donald Knuth...

Microservices
APM
Scala

29.11.2019 | 11 minutes reading time

Custom Akka HTTP PathMatcher

Akka HTTP provides some nice PathMatchers such as JavaUuid and number-specific matchers next to the string-based matchers to handle path parameters. But sometimes this is not sufficient. A path parameter has to be transformed into a specific data structure...

Reactive Programming
Java

7.11.2017 | 4 minutes reading time

JWT authentication with Akka HTTP

The authentication of RESTful APIs is quite an often asked question, so I decided to demonstrate basic authentication via JWT (JSON Web Token) in an example of an API built with Akka HTTP.JWT working conceptBefore we start with the actual coding, we ...

Reactive Programming
IT-Security
Scala

19.9.2017 | 6 minutes reading time

Gatling Load Testing Part 1 – Using Gatling

Gatling is a Scala-based load testing tool developed by the Gatling Corp. The tool itself is open source and can be found on GitHub . On top of the open part, an enterprise edition exists.Load tests in Gatling are written in Scala. The API for writing...

Testing
APM
Scala

20.6.2017 | 20 minutes reading time

Lookup additional data in Spark Streaming

When processing streaming data, the raw data from the events are often not sufficient. Additional data must be added in most cases, for example metadata for a sensor, of which only the ID is sent in the event.In this blog post I would like to discuss...

Software architecture
Scala
Big Data
Data
Streaming

1.6.2017 | 8 minutes reading time

Matthias Niehoff

Akka Best Practices: Defining Actor Props

Akka provides an implementation of the actor model for building reactive applications . So in Akka, an application is made up of actors rather than of plain old objects. When creating actors, we need to pass Props instances. So in this blog post I’...

Reactive Programming
Scala

10.3.2017 | 4 minutes reading time

Ad hoc polymorphism in Scala for the mere mortals

In this blog post we are going to discuss ad hoc polymorphism and the Type Class Pattern in Scala in very simple terms. No knowledge of algebraic structures is required. Starting with a simple function for adding a pair of integers, we will progress ...

Scala
Software development

23.2.2017 | 11 minutes reading time

SMACK stack from the trenches

This is going to be a sum-up of the experience gathered on various projects done with the SMACK stack. For details about the SMACK stack you might want to take a look at the following blog – The SMACK Stack – Hands on . Apache Spark – the S in SMACK...

Reactive Programming
NoSQL
Big Data
Messaging

19.1.2017 | 12 minutes reading time

Hello gRPC! (with ScalaPB)

gRPC is a modern RPC framework developed by Google. It picks up the traditional idea of RPC frameworks – call remote methods as easily as if they were local – while trying to avoid mistakes made by its predecessors and focusing on requirements of microservice...

Scala

10.1.2017 | 7 minutes reading time

IoT Analytics Platform

The Internet of Things a.k.a. the next industrial revolution is the current hype, but what kinds of challenges do we face with the consumption of big amounts of data? One variant is to collect all the data and do post processing in batches. However, ...

Cloud
IoT
NoSQL
Scala
Big Data

13.7.2016 | 15 minutes reading time

Spam classification using Spark’s DataFrames, ML and Zeppelin (Part 1)

This is the first entry in a series of blog posts about building and validating machine learning pipelines with Apache Spark . Its main concern is to show how to explore data with Spark and Apache Zeppelin notebooks in order to build machine learning...

Scala
Big Data
Data
Machine Learning

22.6.2016 | 16 minutes reading time

Swagger for Akka HTTP

In this post I’m hoping to give you a good insight in how to setup Swagger for Akka HTTP. Swagger is a tool to document REST API’s and overall works quite nicely to test some basic API commands. It already has integrations with many languages / REST...

Reactive Programming
API

31.3.2016 | 11 minutes reading time

Lazy Vals in Scala: A Look Under the Hood

Scala allows the special keyword lazy in front of val in order to change the val to one that is lazily initialized. While lazy initialization seems tempting at first, the concrete implementation of lazy vals in scalac has some subtle issues. This article...

Scala

24.2.2016 | 9 minutes reading time