May 2023 - Page 9 of 12 - Mobile Monitoring Solutions

Uncategorized

Remult, a Crud Framework for Fullstack Typescript

MMS • Guy Nesher

Remult is a full-stack CRUD library that simplifies development by leveraging TypeScript models, providing a type-safe API client and query builder.

In software development, two data models must be managed and synchronized to ensure proper system functionality: server and client models. Server models specify the structure of the database and API, while client models define the data transmitted to and from the API.

However, maintaining separate sets of models and validators can result in redundancy, increased maintenance overhead, and the potential for errors when the models become out of sync.

Remult solves this problem by providing an integrated model that defines the database schema, exposes simple CRUD APIs, and supports client-side integration that enables developers to query the database, all while maintaining type safety easily.

Defining Entities

Remult utilizes decorators to transform basic JavaScript classes into Remult Entities. Developers can accomplish this easily by adding the Entity decorator to the class and applying the relevant field decorators to each property.

Using decorators, Remult simplifies the process of creating entities and their associated fields, making it more efficient and intuitive for developers.

import { Entity, Fields } from "remult"

@Entity("contacts", {
  allowApiCrud: true
})

export class Contact {
  @Fields.autoIncrement()
  id = 0
  @Fields.string()
  name = ""
  @Fields.string()
  number = ""
}

Server Side Setup

To start using Remult, register it alongside the necessary entities with the chosen server.

Fortunately, Remult provides out-of-the-box integrations for several popular server frameworks, including Express, Fastify, Next.js, Nest, and Koa.

import express from "express";
import { remultExpress } from "remult/remult-express";
import Contact from "../shared/Contact.ts";

const app = express();

app.use(
  remultExpress({
    entities: [
      Contact
    ]
  })
);

Client Side Integration

After configuring the backend and entities, the next step is integrating Remult with the application’s front end.

Fortunately, Remult’s client integration is designed to be library agnostic, meaning it can operate using browser fetch capabilities or Axios.

To illustrate this functionality, consider the following example:

import { useEffect, useState } from "react"

import { remult } from "remult"
import { Contact } from "./shared/Contact"
const contactsRepo = remult.repo(Contact)

export default function App() {
  const [contacts, setContacts] = useState([])

  useEffect(() => {
    contactsRepo.find().then(setContact)
  }, [])

  return (
    
      Contacts
      
        {contacts.map(contact => {
          return (
            
              {contact.name} | {contact.phone}
            
          )
        })}
      
    
  )
}

This example demonstrates the ease and flexibility with which Remult can be incorporated into the front end of an application, allowing developers to seamlessly leverage the power and functionality of Remult across the entire stack.

Remult is open-source software available under the MIT license. Contributions are welcome via the Remult GitHub repository.

About the Author

Guy Nesher

Show moreShow less

Uncategorized

Podcast: Hien Luu on ML Principles at DoorDash

MMS • Hien Luu

Subscribe on:

Transcript

Introduction [00:17]

Roland Meertens: Welcome everyone to the InfoQ podcast. My name is Roland Meertens and I’m your host for today. I am interviewing Hien Luu, the head of ML platform at DoorDash. We are talking to each other in person at the Qcon London conference, just after he gave the presentation Strategy and Principles to Scale and Evolve MLOps at DoorDash. Make sure to watch his presentation, as it contains many insights into the principles Hien Luu uses to improve the machine learning capabilities at DoorDash. During today’s interview, we will dive deeper into these principles, and I hope you enjoy it and you can learn from it. Hien Luu, welcome to the InfoQ podcast.

Hien Luu: Thank you for inviting me.

Roland Meertens: We are here at QCon London, and you just gave your talk about Strategy and Principles to Scale and Evolve MLOps at DoorDash. Welcome. Could you maybe give a three-minute summary of your talk?

Hien Luu: Sure. The summary is essentially about design for any company, any organizations, that are adopting machine learning operations, building out infrastructure, how they should think about what strategies to use, and which principles that might be helpful for them as they building out infrastructure. The strategies are fairly a little bit biased and opinionated from my perspective and experience, but I think they fairly, at a high level, that most companies can consider leveraging for their specific organization, because each organization has their own specific ML use cases, the culture of their organization, the people that they work with, the maturity of their technologies. So hopefully it’s a framework that’s broad enough that they can use and apply it to their own specific organization’s needs. And similarly, the principals are also very, very high level as well in terms of how do they go about supporting the customers, how to make sure they’re making progress, how do they align with their stakeholders, decision makers and so on?

Roland Meertens: What are the principles you have, then? I think you had three principles, right?

ML Principles at DoorDash [02:15]

Hien Luu: Yeah. The first one is about think big, start small; and the second is about 1% better every day; and the third one is customer focus or customer obsessed, essentially.

Roland Meertens: I think one of the things I really took as an interesting fact from the start of your presentation was the 85% failure rate of machine learning projects, like 85% of the machine learning projects fail. What’s the main reason behind this? Why is this field so bad at creating projects?

Hien Luu: First, that statistic was called out I believe two or three years ago. That’s at the time when a lot of companies or enterprises are jumping onto leveraging the ML to add value to the products. Internet companies like Google, Facebook or Meta, or LinkedIn, they had done this for a long, long time; but enterprises, this is something new to them, and if they jump in without any proper planning or thinking, and the infrastructures are not there, and I think that’s part of the reasons why the failure rate is so high, because it’s new. It’s not something that you just do. You need to have the proper team, you got to have the right approach to applying ML, you need to have the right data, and you need to integrate that into the online products or services.

So it’s a whole structure of things that need to come together. It’s not something you can just do. So maybe that’s my guess in terms of why the failure rate is so high, but everything as an industry in the MLOp space, it has matured quite a bit in the last three years. There’s a lot of startups, a lot of money that went to that, and startups are now… The products are maturing, and also the best practices are also being shared and developed. So I think my hope is the next time somebody do the same survey, the number will be much lower.

What makes ML difficult? [04:11]

Roland Meertens: And do you think, if you want to point fingers, is it mostly an organization problem, is it simply that people don’t have an MLOps team, or are the data scientists not good enough at engineering?

Hien Luu: I would say it’s a combination of those things. It’s not just one, because each organization is very, very different. Where they start from, what talent they have, do they have enough data for their use cases or not? So I think for each one of those, it’s a little bit unique in their own, but the number one thing I would say it’s about the engineering mentality that was not there in order to productionalize machine learning models. Data scientists might build a model and they just throw out a wall, for example, and there’s no reproducibility, right? There’s no way of treating ML artifacts as code. There’s no such mentality until the whole MLOps discipline has been codified or formalized in the last few years.

Roland Meertens: So it’s really also a bit of getting the discipline, getting more structure, maybe.

Hien Luu: Because evolving model is not a one-time thing, like other things, like other maybe applications. It’s a continuous basis of, you have version 1.0, but data changes, your use case might change as requirements might change. So they might have to go to and develop the second version of that, for example. And if the engineering disciplines are not followed, then the second time building that model would be really challenging. Data scientists might have left a company and nobody knows how to rebuild that model, for example.

Roland Meertens: Normally the second time you build something, it should be easier and faster to do, and not equally difficult.

Hien Luu: Right, exactly. And it might not be the case for ML, and that’s why it takes so long to get model to production, and some cases you don’t even get to production because of those reasons.

Concrete tips for improving [05:58]

Roland Meertens: So if we’re taking, then, your principle 1% better every day, do you have some tips on how people should get started? What are some concrete tips for data scientists to, with simple things, make their output way better?

Hien Luu: I think from the MLOps infrastructure, there’s a lot that can be done, but you don’t have to build everything at once. You want to build enough at the current state to unblock whatever it is blocking that data scientist or that organization of going from ideas to production. And then incrementally, over the months’ quarter, then you building out the bells and whistles and the things that they need. But the initial point is just how do we get from ideas to production quickly, and building just enough for those tools? It might look ugly initially, but it actually unblocks somebody to get to the next stage. So it is all about not aiming for perfection, but building incrementally, just enough to get so the data scientist can be unblocked and do whatever they need to do.

Biggest blockers [06:58]

Roland Meertens: And do you see most problems, then, for example, with the data versioning or logging or compute, what are some of these main blockers you mostly see at DoorDash or at other companies?

Hien Luu: I can talk about DoorDash and then we can talk about other companies as well. I think the DoorDash story is quite interesting because of the high growth that we have gone through in the last few years, and that high growth creates a lot of interesting challenges in terms of the large volume of data that we need to be able to ingest, move it, and compute; and doing that reliably and with high quality. I think that’s what we’ve been learning to do better and better throughout quarters. Initially the infrastructure’s not quite there to be able to do that easily and in a cost-effective way. It’s not like we don’t know how to do, it’s just the growth is so fast. And sometimes we don’t have enough manpower to do it because it’s just the growth was tremendous in the last three years.

I think for other companies, obviously each organization is at a different state in terms of their adopting to the cloud. Do they have enough data engineering team? Do they have a solid experimentation platform? Do they have a way to easy to get access to compute resources without having requiring a data scientist to do all those low level stuff that need to get to machines? So therefore, data scientists now cannot focus on their main task, but doing all these engineering work. So it depends on the size of the organization and where they are at in terms of their maturity at each of those infrastructures that are commonly needed for dual machine learning development.

Roland Meertens: And would you say that this is some kind of hidden tech debt where, if people didn’t set up their MLOps stack in the right way?

Hien Luu: I wouldn’t say tech debts yet, because a lot of these enterprises just starting in their journey, so it’s more about how to get there, because they don’t have the right infrastructure, tooling, data, talent for example. I think in the coming years, the bigger story will be about tech debts, but in the last two years, more about how to get from A to B than anything else.

How to improve as a company [09:07]

Roland Meertens: And is that then something where the company culture maybe has to change, where the company culture has to say, okay, we try to focus on hiring more of these people, or is it something else?

Hien Luu: Yeah, I think partially it’s like organizations that are new to machine learning, they don’t quite know how it works. They don’t quite know what the process looks like. They don’t quite know how machine learning can bring value. They read, they understand the report, but they don’t know, they haven’t gone through that experience. And they might have high expectations initially, but then the model, turns out, doesn’t do exactly quite what they want or expect, then they treat as a failure, for example. So machine learning is about iterations, getting better and better. Once you have more data you can train at larger scale and so on. So expecting the first model to hit the ball out of the park, I think that may be one of the reasons.

Roland Meertens: Yeah, I think that’s one of the things which I took away from your talk, which I found quite interesting. Whilst identifying and aligning the needs of customers and the different stakeholders and getting all the decision-makers together and setting the right expectations, do you have any tips for that, for either people who expect that you have a massive improvement with a new model, or how do you go about this?

Hien Luu: I think the first thing is aligning on what the business problem is in terms of applying machine learning to solve for that problem, and going through the process of determining whether ML is still the right solution or not. And then the second part is setting the expectation around the successful metrics, the business metrics that we want to use to evaluate that ML against, and then go through iterations of building a first version of the model and see if that actually meets, and then iterate from there. And also setting expectation about how long would it take either, because if there’s something that’s new to an organization, they might not know. And the expectation might come from building online applications, which is a very different process than machine learning because there’s a lot of unknowns in machine learning until you get down to it.

Roland Meertens: I think also one thing in machine learning is that you are not just dealing with an algorithm which gives you the right output, but you will also have to deal with false positives and false negatives.

Hien Luu: Yes.

Roland Meertens: How do you work with stakeholders to get them to decide what to do with these cases, for example?

Hien Luu: It comes back to that educating the stakeholder about machine learning itself and the uncertainty involved in machine learning. And then also the next step I would say is about setting expectation about iteration. I think iteration is probably one of the key things. Setting the right expectation up front that the first model might not work out well, and it will take time, and I think that’s maybe hard for business to understand initially, but that is something that needs to be expressed and shared. Otherwise, the expectation’s going to be there, and then frustrations will come if the first model doesn’t work out the way it was expected.

Roland Meertens: I think the other difficulties always that’s in machine learning, the boundary between what’s possible and what’s impossible is razor-thin. Do you have any way to set the right expectations, then?

Hien Luu: If you hire experienced data scientists, they can bring their stories, they can bring their experience, or maybe look at use cases that are reported out there in the industry in terms of how that went. I think that using those data points maybe help these organizations to lower their expectation, or at least be educated about these scenarios where things might not work out initially.

Roland Meertens: The other thing which I think was funny in your talk was this customer obsession, and you were talking about the french fry moment. What is a french fry moment?

Hien Luu: This phrase was coined by the folks from Google, essentially delighting your customers with solutions that they don’t expect. The french fry moment is about, if somebody orders a burger and they don’t order fries, and you delighting them with delivering that french fry with them, because they go together, the burgers and fries normally go together. So it’s about delighting your customers with solutions that they don’t prompt for it, or they might not know that they need it. But once you built it and show it to them, they say, wow, this is great. That’s what I call a french fry moment. It’s not something they come and ask you to build it. It’s based on what you observe, based on what you know, and you come up with the idea and built it and then show it to them.

Feature stores [13:48]

Roland Meertens: If we get a bit more technical, you mentioned that you’re using Redis as a feature store, and you also have some tips to improve it. So how do you, like for example, using Redis as a feature store, and did you think of any other alternatives?

Hien Luu: Our journey of our feature store is quite fascinating. For our use cases, for online prediction use cases where low latency is one of the key requirements. See therefore there are solutions out there besides Redis like Cassandra, but for us, Redis has been working really well in our company, and we natively support that within the infrastructure team, so it’s easy to get that going. So Redis is in memories, on memory, majority of it, and then it’s very suitable for low latency use cases. Do all use cases need low latency? Maybe not.

So therefore there are certain sets of use cases are still need low latency but not very, very low. Then we can consider our other options that are more cost-effective, essentially. And we have evolved our feature store to support also another storage engine called CockroachDB in the last few years. So we have a mix of these backend storage as Redis and CockroachDB. So depending on the use case and the needs, we might send those features to the right store and source engine, and can still meet the latency requirements, and then can be more cost-effective as well.

Roland Meertens: Interesting. And so at the end of the talk, you were talking about some future-oriented things. What do you think are the key trends and advances that companies or MLOps teams should anticipate in the near future?

Hien Luu: It also depends on the need of the company as well, because for us, obviously we have more and more data scientists, we have more and more data, and there’s a need to us to increase the performance of the model. And deep learning is one of the techniques to do that. So from our perspective is how do we support model training at scale, and using GPUs for example. And then similarly on the prediction side, how do we support prediction with low latency for those large complex deep learning models with low latency? So there’s many, many techniques for that, but also one of the easiest ways is leveraging GPUs again. So that is going to be our focus in 2023, is how to support that across training and model serving for deep learning models.

GPU Computing [16:08]

Roland Meertens: And to end with, do you have any tips, then, for people who want to get started? Do you have any specific tools which you like to use here to get these models in production or to get to leverage your GPUs?

Hien Luu: So GPUs are not cheap, I’m sure you are aware. So it’s also coming back to the ROI of a use case, if it makes sense to leverage GPU or not, based on the impact that use case might have to the company. Obviously money, it’s all companies worry about. So it’s all about making sure you’re using the right GPUs for the right use cases. So that’s going to be that learning process of testing out a model on GPUs and see what the performance differences would be like, and using the tooling to understand how efficient those GPUs are being used. They’re not sitting idle for that use case. We are going to be using a lot of tooling to help us understand what is the right use case for GPUs, and how efficient are we using the GPUs for that use case. So that will be a very key aspect as we move forward.

Roland Meertens: So do you actively try to deploy the same model, then, maybe on a CPU and a GPU, and you see what the impact is on your business?

Hien Luu: Correct, exactly. That’s going to be one of the steps as we move into this world leveraging more and more GPUs. It’s understanding would they reduce latency, how much business impact would that bring to the company, and whether that would justify the additional costs of GPUs?

Roland Meertens: Interesting. Do you have any tips for specific tooling here, which I can try out or which people can try out?

Hien Luu: Yeah, I think there’s toolings that from, again, we’re still early in this journey, but there are toolings available from NVIDIAs for profiling the GPUs and such, that you can do a Google search or whatever that might be to find out. For us, it’s still in… We are at the beginning of this journey, so we’re still in a learning phase there.

Roland Meertens: Thank you very much for joining the podcast. Enjoy the rest of your conference.

Hien Luu: Thank you.

About the Author

Hien Luu

Show moreShow less

.
From this page you also have access to our recorded show notes. They all have clickable links that will take you directly to that part of the audio.

Uncategorized

Vercel Announces New Storage and Security Offerings for the Egde – InfoQ

MMS • RSS

Vercel recently announced a suite of serverless storage offerings for their cloud platform with Vercel KV, Postgres, and Blob, powered by the company’s infrastructure partners, Neon and Upstash. In addition, the company also launched Vercel Secure Compute, Visual Editing, and Spaces.

The new storage offerings are intended to provide frontend developers building applications for the edge with fast and efficient access to data. In particular, these offerings are:

Vercel KV (Key-Value) is a serverless Redis solution powered by Upstash, allowing developers to create Redis-compatible databases that can be written to and read from Vercel’s Edge Network in regions they specify.
Vercel Postgres is a serverless SQL database built for the front end powered by Neon, providing developers with a fully managed, highly scalable, fault-tolerant database that delivers high performance and low latency for web applications.
Vercel Blob is a solution to upload and serve files at the edge currently in preview. The offering provides an API built entirely on top of web standards without the need to configure buckets or implement SDKs.

Source: https://vercel.com/blog/vercel-storage

In a company blog post, the authors explain the rationale behind these storage offerings:

As the world moves away from monolithic architectures to composable ones, there’s no shortage of options for backends and databases. But for new projects, the choice can still be paralyzing. In the spirit of being the end-to-end solution for building on the web, we are introducing solutions that are open, easy to use, and scale as efficiently as our frontends.

Furthermore, the company has additional offerings, which include:

Vercel Secure Compute allows developers to create private connections between serverless functions and protect their backend cloud in order to increase security, compliance, and privacy obligations on the Web. Accompanying this offering is Vercel Firewall.
Vercel Visual Editing resulted from a partnership with Sanity to introduce a new open standard for content source mapping for headless CMSs, which allows live visual editing of content directly on a website and provides a tunnel directly to the content’s source.
Vercel Spaces provides powerful tools and conventions designed to integrate with a developer’s monorepo setup to help scale efficiently while retaining quality.

Malte Ubl, CTO at Vercel, told InfoQ:

Developers need tools to build and deploy applications at scale. Our new offerings, which include Storage databases, Vercel Secure Compute, Live Editing, and Vercel Spaces, provide the frontend ecosystem with durable and seamless solutions allowing them to innovate faster. Our expanded suite of cloud-native tools ladders up to our vision for the Frontend Cloud, empowering developers to build, test and deploy high-quality web applications efficiently.

Various people responded to announcements from Vercel; Yehuda Fruchter, an internet entrepreneur and investor, tweeted:

Vercel is becoming a monolith.

With Guillermo Rauch, CEA at Vercel, responding:

Everything about the platform remains composable. In fact, today’s announcements are about precisely that: meeting you where you are backend-wise, securely.

Followed by Fruchter’s response:

Yeah, sure I get that, but practically-speaking this will ultimately be Vercel controlling everything end-to-end. I don’t think that’s a problem, frankly, as the industry needs one-player to “consolidate” all these microservices b/c there r too many options.

Lastly, the pricing details of the offering are available on the respective pricing pages: Vercel KV pricing page, Vercel Postgres pricing page, Vercel blob pricing page, Vercel Secure Compute, Vercel Visual Editing, and Spaces via Enterprise plans.

About the Author

Steef-Jan Wiggers

Show moreShow less

Uncategorized

Kubernetes 1.27 Released with Freeze on Legacy Image Registry and In-Place Update of Pod Resources

MMS • Mostafa Radwan

The Cloud Native Computing Foundation (CNCF) released Kubernetes 1.27 with the name chill vibes. The release has new features, such as In-place Update of Pod Resources, Limit of Parallel Image Pulls, and Multiple Service CIDRs.

Also, there are beta features included in the release, such as automatic removal of PVCs created by StatefulSets, addition of a new subresource flag to kubectl commands, and Component Health SLIs.

Several features have been marked generally available or stable, such as freeze on the legacy container image registry k8s.gcr.io ,default usage of seccomp profiles, and support of OpenAPI v3.

In version 1.27, the SecurityContextDeny admission plugin is deprecated and Azure disk in-tree storage plugin is removed.

In the new release, there is a freeze on the usage of the legacy container image registry that is hosted on a custom google container registry. Since March 20th, traffic from k8s.gcr.io has been redirected to the community owned registry.k8s.io with the intention of sunsetting k8s.gcr.io. The change should provide users with high availability and reduced latency by hosting the community container registry on multiple cloud providers instead of a single entity.

Xander Grzywinski from the release team and senior product manager at Microsoft commented on this particular feature.

I know we have been talking a lot about this throughout the whole release cycle and I’m sure some folks are tired of hearing about it but it bears mentioning again. The 1.27 images are not going to be published to the old registry. Any requests to the old registry will be redirected to the new one, but if you’re behind a proxy, there’s work needs to be done.

In-place Update of Pod Resources has been introduced to provide the ability to change container resource requests and limits without having to restart the pod. This is turned off by default and it allows users to specify in the container’s field resizePolicy whether a restart is required.

Mark Rossetti from the release team and principal software engineer at Microsoft commented on this.

A lot of people have been waiting for this feature. With this functionality, you can add or subtract resources to containers without needing to restart them. This will help a lot of workloads scale faster and with fewer interruptions.

Also in the new release, a limit can be set on the number of parallel image pulls by the kubelet to avoid consuming too much network bandwidth and disk that can degrade the performance of the cluster.

In addition, there are no size limitations associated with the service-cluster-ip-range field when it comes to internal IP addresses of services CIDRs. This is turned off by default and it enables users to easily inspect the IP addresses assigned to services.

StatefulSet PVC Auto-Deletion moved to beta in version 1.27 and it’s turned on by default. In the past, when a StatefulSet was deleted the associated PVCs and the underlying volumes were retained. This generated manual work and caused incurring costs of the unused storage.

To fetch and update subresources for all API resources, the –subresource flag is added to kubectl in beta which allows fetching and updating the status and scale of subresources. More subresources will be added in the coming releases.

SLIs metrics for Kuberetes components graduated to beta allowing to monitor and measure the availability for Kubernetes internals. Two metrics types will be exposed for each component: gauge, and counter.

Using a seccomp profile by default became generally available in this release. This feature improves the security of pod containers in Kubernetes by executing them with a seccomp profile restricting allowed system calls.

Support for OpenAPI v3, a language-agnostic standard interface to APIs, became generally available and enabled by default. This allows Kubernetes to publish a description of APIs in the OpenAPI v3 format.

Kubernetes is an open source container orchestration software for deploying and managing containerized applications at scale.

According to the release notes, Kubernetes version 1.27 has 60 enhancements including 18 entering alpha, 13 becoming generally available or stable, and 29 graduating to beta. In addition, 14 features are being deprecated or removed.

CNCF held a webinar on April 14, 2023, to discuss the changes from the release team.

About the Author

Mostafa Radwan

Show moreShow less

Uncategorized

Java News Roundup: JEPs for JDK 21, Spring Cloud AWS 3.0, OptaPlanner to Timefold

MMS • Michael Redlich

This week’s Java roundup for May 1st, 2023 features news from OpenJDK, JDK 21, Spring Boot 3.1.0-RC2, Spring Modulith 0.6, Spring Cloud for Amazon Web Services 3.0.0, Spring Cloud Data Flow 2.10.3, Spring Tools 4.18.2, Infinispan 14.0.9.Final, Open Liberty 23.0.0.4, Quarkus 3.0.2 and 2.16.7, Helidon 3.2.1, Apache Camel 4.0.0-M3, Arquillian 1.7.0 and OptaPlanner transitions to Timefold.

OpenJDK

JEP 448, Vector API (Sixth Incubator), has been promoted from Candidate to Proposed to Target for JDK 21. This JEP, under the auspices of Project Panama, incorporates enhancements in response to feedback from the previous five rounds of incubation: JEP 438, Vector API (Fifth Incubator), delivered in JDK 20; JEP 426, Vector API (Fourth Incubator), delivered in JDK 19; JEP 417, Vector API (Third Incubator), delivered in JDK 18; JEP 414, Vector API (Second Incubator), delivered in JDK 17; and JEP 338, Vector API (Incubator), delivered as an incubator module in JDK 16. This feature proposes to enhance the Vector API to load and store vectors to and from a MemorySegment as defined by JEP 424, Foreign Function & Memory API (Preview). The review is expected to conclude on May 9, 2023.

JEP 445, Unnamed Classes and Instance Main Methods (Preview), has been promoted from Candidate to Proposed to Target status for JDK 21. This feature JEP, formerly known as Flexible Main Methods and Anonymous Main Classes (Preview) and Implicit Classes and Enhanced Main Methods (Preview), proposes to “evolve the Java language so that students can write their first programs without needing to understand language features designed for large programs.” This JEP moves forward the September 2022 blog post, Paving the on-ramp, by Brian Goetz, Java language architect at Oracle. Gavin Bierman, consulting member of technical staff at Oracle, has published the first draft of the specification document for review by the Java community. The review is expected to conclude on May 12, 2023. InfoQ will follow up with a more detailed news story.

JEP 441, Pattern Matching for switch, has been promoted from Candidate to Proposed to Target for JDK 21. This JEP also finalizes this feature and incorporates enhancements in response to feedback from the previous four rounds of preview: JEP 433, Pattern Matching for switch (Fourth Preview), delivered in JDK 20; JEP 427, Pattern Matching for switch (Third Preview), delivered in JDK 19; JEP 420, Pattern Matching for switch (Second Preview), delivered in JDK 18; and JEP 406, Pattern Matching for switch (Preview), delivered in JDK 17. This feature enhances the language with pattern matching for switch expressions and statements. The review is expected to conclude on May 11, 2023. InfoQ will follow up with a more detailed news story.

JEP 440, Record Patterns, has been promoted from Candidate to Proposed to Target for JDK 21. This JEP finalizes this feature and incorporates enhancements in response to feedback from the previous two rounds of preview: JEP 432, Record Patterns (Second Preview), delivered in JDK 20; and JEP 405, Record Patterns (Preview), delivered in JDK 19. This feature enhances the language with record patterns to deconstruct record values. Record patterns may be used in conjunction with type patterns to “enable a powerful, declarative, and composable form of data navigation and processing.” Type patterns were recently extended for use in switch case labels via: JEP 420, Pattern Matching for switch (Second Preview), delivered in JDK 18, and JEP 406, Pattern Matching for switch (Preview), delivered in JDK 17. The most significant change from JEP 432 removed support for record patterns appearing in the header of an enhanced for statement. The review is expected to conclude on May 11, 2023. InfoQ will follow up with a more detailed news story.

JEP 439, Generational ZGC, has been promoted from Candidate to Proposed to Target for JDK 21. This JEP proposes to “improve application performance by extending the Z Garbage Collector (ZGC) to maintain separate generations for young and old objects. This will allow ZGC to collect young objects, which tend to die young, more frequently.” The review is expected to conclude on May 10, 2023. InfoQ will follow up with a more detailed news story.

JEP 404, Generational Shenandoah (Experimental), has been promoted from Candidate to Proposed to Target status for JDK 21. This JEP proposes to “enhance the Shenandoah garbage collector with generational collection capabilities to improve sustainable throughput, load-spike resilience, and memory utilization.” Compared to other garbage collectors, such as G1, CMS and Parallel, Shenandoah currently requires additional heap headroom and has a more difficult time recovering space occupied by unreachable objects. The review is expected to conclude on May 12, 2023. InfoQ will follow up with a more detailed news story.

JEP 450, Compact Object Headers (Experimental), has been promoted from its JEP Draft 8294992 to Candidate status. Under the auspices of Project Lilliput, the JEP draft proposes to reduce the size of Java object headers from 96 or 128 bits to 64 bits. Project Lilliput, created by Roman Kennke, principal engineer at Amazon Web Services, marked a milestone 1 in May 2022 by achieving 64-bit headers.

Daniel Smith, Programming Language Designer at Oracle, has announced that JEP 401, formerly known as Null-Restricted Value Object Store (Preview) and Primitive Classes (Preview), has been renamed to Flattened Heap Layouts for Value Objects. Smith has provided an updated specification document for review by the Java community.

JDK 21

Build 21 of the JDK 21 early-access builds was also made available this past week featuring updates from Build 20 that include fixes to various issues. Further details on this build may be found in the release notes.

For JDK 21, developers are encouraged to report bugs via the Java Bug Database.

Spring Framework

The second release candidate of Spring Boot 3.1.0 ships with new features such as: change the default shutdown in the DockerComposeProperties class to stop; automatically apply the TestcontainersLifecycleApplicationContextInitializer class for context tests; and the addition of Docker Compose service connection support for the SQL Server, Oracle Database, Liquibase, Flyway and Cassandra databases. There was also a deprecation of the Couchbase SSL keystore properties, spring.couchbase.env.ssl.key-store and spring.couchbase.env.ssl.key-store-password, in favor of SSL bundle support in Couchbase. More details on this release may be found in the release notes.

The release of Spring Modulith 0.6 delivers bug fixes, dependency upgrades and notable new features such as: auto-configuration for MongoDB transactions if the event publication registry is used; the event publication registry now enables asynchronous processing and shutdown behavior; the @EnableScenario annotation for using the Scenario Testing API with @SpringBootTest integration tests; and support for jMolecules architecture stereotypes in the Application Module Canvas. The Spring Modulith team has also decided to elevate this project into a top-level, non-experimental Spring project. The plan is to release a 1.0-M1 version after the GA release of Spring Boot 3.1. Further details on this release may be found in the release notes.

Version 3.0.0 of Spring Cloud for Amazon Web Services has been released with new features: compatibility with Spring Boot 3.0; built on the top of AWS SDK V2 for Java; a completely re-written SQS integration module; and a new integration of DynamoDB. More details on this release may be found in the release notes.

The release of Spring Cloud Data Flow 2.10.3 primarily addresses security issues in transitive dependencies such as: spring-security-oauth2-client-5.4.2; spring-expression-5.2.11; spring-webmvc-5.3.25; json-smart-2.3; and jettison-1.51. There were also dependency upgrades to Spring Boot 2.7.11 and Spring Cloud sub-projects. Further details on this release may be found in the release notes.

Spring Tools 4.18.1 has been released featuring enhancements such as: support for navigating to a Spring property file when inspecting on an @Value annotation; support for the @ConditionalOnProperty annotation in property navigation; and early access to Eclipse 2023-06 milestone builds. The Spring Tools team anticipates version 4.19.0 to be released in late June 2023. More details on this release may be found in the release notes.

Infinispan

Infinispan 14.0.9.Final has been released with notable changes such as: fix the failure of Infinispan third party integration tests with JDK17; document how to monitor cross-site replication; remove the dependency Jaeger test containers; and fix the port number in the properties file. Further details on this release may be found in the changelog.

Open Liberty

IBM has released Open Liberty 23.0.0.4 featuring: container images for the ARM64 architecture along with the existing AMD64, PPC64LE and S390X architectures; and a resolution for CVE-2023-24998, a vulnerability in Apache Commons FileUpload such that an attacker can trigger a denial-of-service with malicious uploads due to the number of processed request parts is not limited.

Quarkus

Quarkus 3.0.2.Final, the second maintenance release, ships with notable changes such as: rename the server-list file to hosts in the Infinispan Dev Services guide; Dev UI2 displaying the wrong Java version; the k3s flavor name is not properly documented in the Kubernetes Dev Services guide; and RESTEasy Reactive streaming resource methods leads to NoSuchMethodException exception in native mode. More details on this release may be found in the release notes.

Quarkus 2.16.7.Final has also been released featuring: a fix for the algorithm comparison bug in OIDC code loading the token decryption key; a minor update to the OIDC UserInfo class throwing NullPointerException if a string or boolean property with a given name does not exist; Quarkus dev mode not working with a certain type of project directory tree when using the @ApplicationScoped annotation; and throw an exception if the OIDC client fails to acquire a token. Further details on this release may be found in the release notes.

Helidon

Oracle has released Helidon 3.2.1 with new features such as: an enabled flag to the JpaExtension class to permit subsequent refactoring and replacement; integration changes with the MicroProfile Rest Client and Fault Tolerance specifications to handle async calls due to an issue with the default invocation context in the Weld specification; and support for different propagators with integration of Jaeger OpenTelemetry. More details on this release may be found in the release notes.

Apache Software Foundation

The third milestone release of Apache Camel 4.0.0 features bug fixes, dependency upgrades and improvements such as: change the default Micrometer meter names to follow the Micrometer naming conventions; support for Micrometer Observation; directly use the HTTP server in the implementation of Spring Boot; and add a listener for added/removed HTTP endpoints that make it easier for runtimes, such as Spring Boot, to use platform-http with Camel and its own HTTP server. Further details on this release may be found in the release notes.

Arquillian

Arquillian 1.7.0.Final has been released featuring: support for Jakarta Servlet 6.0; support for HTTPS in URLs injected with the @ArquillianResource annotation; and a fix for a NoClassDefFoundError exception from the LoggerFactory class when using TestNG 7.5+. More details on this release may be found in the changelog.

OptaPlanner Transitions to Timefold

OptaPlanner, an open source AI constraint solver for software developers, will transition to Timefold, a new planning optimization company created by Maarten Vandenbroucke, co-founder and CEO, and Geoffrey De Smet, co-founder and CTO. Created by De Smet while working at Red Hat as a senior principal software engineer, OptaPlanner has matured under the auspices of Red Hat by providing their own build of OptaPlanner. InfoQ will follow up with a more detailed news story.

About the Author

Michael Redlich

Show moreShow less

Uncategorized

.NET Community Toolkit 8.2: MVVM Toolkit Attributes, Performance Enhancements, and More

MMS • Almir Vuk

Microsoft has released the latest version of its .NET Community Toolkit, version 8.2, with several new enhancements and features. According to the release, this new version brings performance enhancements both at runtime and in the MVVM Toolkit source generators, new code fixers aimed at boosting productivity, and a number of user-requested features.

The first notable addition to the MVVM Toolkit is the support for custom attributes with RelayCommand. This was suggested by users on GitHub and builds on the work done in the previous release. The new version leverages the native field: and property: C# syntax to indicate targets of custom attributes, giving users full control over attributes for all generated members when using RelayCommand to generate an MVVM command. This feature is particularly useful when using a view model that needs to support JSON serialization and requires the ability to explicitly ignore generated properties.

The original blog post provides the following code examples regarding the custom attributes with RelayCommand.

[RelayCommand]
[property: JsonIgnore]
private void DoWork()
{
    // other code.
}

As a result, the previous code sample will produce the following members in the runtime.

private RelayCommand? _doWorkCommand;

[JsonIgnore]
public IRelayCommand DoWorkCommand => _doWorkCommand ??= new RelayCommand(DoWork);

Furthermore, in the 8.2 release of the Toolkit, developers can now take advantage of two new property change hooks that have been added to the ObservableProperty fields. These new two hooks aim to address a common scenario in MVVM, where an observable property such as a “selected item” needs to be modified, requiring changes to both old and new instances.

Sergio Pedri, Software Engineer II, Microsoft Store client team and author of the original blog post states the following:

Previously, this was a scenario where using [ObservableProperty] wasn’t ideal, as it didn’t have the necessary infrastructure to easily inject such logic to perform the necessary state changes on the old and new values being set. To fix this, starting from the 8.2 release of the MVVM Toolkit there are two new property change hooks being generated for all [ObservableProperty] fields.

The blog post also provides detailed code examples regarding the usage of the ObservableProperty attribute and it is worth checking out.

By the usage of the ObservableProperty attribute, developers can ensure that the selected view model will always be correctly reported as being selected. The attribute now includes built-in support for this functionality, eliminating the need for fallback methods. The MVVM Toolkit is also optimized to automatically detect usage of this attribute, optimizing code generation. Additionally, the Roslyn compiler will remove calls to any methods that are not implemented.

According to an original blog post, the MVVM Toolkit has introduced new diagnostic analyzers in its latest release, which can detect and warn users when they access a field marked with the ObservableProperty attribute incorrectly, or when they declare a type with similar attributes without using inheritance. Moreover, the latest release of the toolkit also includes built-in code fixers for these two analyzers. As per the report, users can now easily fix their code by selecting the code fix suggested by the IntelliSense light bulb whenever the analyzers produce a warning. Additionally, the code fixers support bulk fixes, enabling users to rectify all their errors with just one click.

Regarding MVVM Toolkit, this version also brings some performance improvements to its source generators. The primary focus was on optimizing the incremental pipelines to minimize memory usage and ensure that no unnecessary objects would be kept alive across concurrent executions. Several pull requests were made, including moving two additional diagnostics to a diagnostic analyzer, which can run concurrently and out of the process. The another one is removing some Roslyn symbols from the incremental pipeline, and resolving necessary analyzer symbols early during the initial callback setup to speed up callback executions in each compilation instance.

Lastly, this release includes enhancements and fixes, such as resolving build errors in VB.NET projects and fixing forwarded double attribute parameters. The release now supports partial methods with RelayCommand and open generic types in ToTypeString. In addition, MemberNotNull is now emitted in ObservableProperty setters, and complete XML docs are available for all generated types and members for better understanding.

The complete list of changes made in this release can be viewed on the GitHub release page.

About the Author

Almir Vuk

Show moreShow less

Uncategorized

Vercel Announces New Storage and Security Offerings for the Egde

MMS • Steef-Jan Wiggers

The new storage offerings are intended to provide frontend developers building applications for the edge with fast and efficient access to data. In particular, these offerings are:

Vercel KV (Key-Value) is a serverless Redis solution powered by Upstash, allowing developers to create Redis-compatible databases that can be written to and read from Vercel’s Edge Network in regions they specify.
Vercel Postgres is a serverless SQL database built for the front end powered by Neon, providing developers with a fully managed, highly scalable, fault-tolerant database that delivers high performance and low latency for web applications.
Vercel Blob is a solution to upload and serve files at the edge currently in preview. The offering provides an API built entirely on top of web standards without the need to configure buckets or implement SDKs.

Source: https://vercel.com/blog/vercel-storage

In a company blog post, the authors explain the rationale behind these storage offerings:

Furthermore, the company has additional offerings, which include:

Vercel Secure Compute allows developers to create private connections between serverless functions and protect their backend cloud to increase security, compliance, and privacy obligations on the Web. Accompanying this offering is Vercel Firewall.
Vercel Visual Editing resulted from a partnership with Sanity to introduce a new open standard for content source mapping for headless CMSs, which allows live visual editing of content directly on a website and provides a tunnel directly to the content’s source.
Vercel Spaces provides powerful tools and conventions designed to integrate with a developer’s monorepo setup to help scale efficiently while retaining quality.

Malte Ubl, CTO at Vercel, told InfoQ:

Developers need tools to build and deploy applications at scale. Our new offerings, which include Storage databases, Vercel Secure Compute, Live Editing, and Vercel Spaces, provide the frontend ecosystem with durable and seamless solutions allowing them to innovate faster. Our expanded suite of cloud-native tools ladders up to our vision for the Frontend Cloud, empowering developers to build, test and deploy high-quality web applications efficiently.

Various people responded to announcements from Vercel; Yehuda Fruchter, an internet entrepreneur and investor, tweeted:

Vercel is becoming a monolith.

With Guillermo Rauch, CEA at Vercel, responding:

Everything about the platform remains composable. In fact, today’s announcements are about precisely that: meeting you where you are backend-wise, securely.

Followed by Fruchter’s response:

Yeah, sure I get that, but practically-speaking this will ultimately be Vercel controlling everything end-to-end. I don’t think that’s a problem, frankly, as the industry needs one-player to “consolidate” all these microservices b/c there r too many options.

About the Author

Steef-Jan Wiggers

Show moreShow less

Uncategorized

Diving into AWS Databases: Amazon RDS and DynamoDB Explained – The New Stack

MMS • RSS

Diving into AWS Databases: Amazon RDS and DynamoDB Explained – The New Stack

2023-05-05 12:30:17

Diving into AWS Databases: Amazon RDS and DynamoDB Explained

sponsor-mission-cloud,sponsored-post-contributed,

A look at the differences between these popular options, and between relational and nonrelational databases.

May 5th, 2023 12:30pm by

Casey Samulski

Featued image for: Diving into AWS Databases: Amazon RDS and DynamoDB Explained

The increasing demand for data processing and manipulation has led organizations to prioritize efficiency, security, scalability and availability in their data management strategies. Companies that leverage top tools in the market enjoy a higher chance of success, which is why Amazon Web Services (AWS) should be a top contender when considering database storage solutions.

AWS offers an extensive range of database services for various use cases. In this article, we’ll explore the primary differences between two popular options:

Relational Database Service (RDS)
DynamoDB

Before we delve into the specifics of each service, let’s first look at the distinctions between relational and non-relational databases.

Relational (SQL) vs. Nonrelational (NoSQL) Databases

Relational databases use predefined schemas and store data in rows and columns, resembling a spreadsheet. In contrast, nonrelational databases like DynamoDB feature dynamic schemas, are document-oriented and scale horizontally.

Relational (SQL) Databases

Relational databases employ Structured Query Language (SQL) for interaction and use predefined schemas. SQL is a widely recognized query language familiar to most database administrators (DBAs).

SQL databases contain tables with columns (attributes) and rows (records) and use keys with constrained logical relationships to maintain consistency and integrity. These databases can be scaled vertically by enhancing processing hardware power (for instance increasing RAM, CPU or solid-state disks).

Advantages of relational (SQL) databases:

Use of SQL language
Atomicity of database operations
Flexible query capabilities

Disadvantages of relational (SQL) databases:

Requires careful upfront design
Schema changes may cause downtime
Limited horizontal scalability

Nonrelational (NoSQL) Databases

NoSQL databases are nonrelational database management systems with dynamic schemas for unstructured data. These databases can be categorized based on their data models:

Document
Key value
Wide column
Graph

NoSQL databases are suitable for large volumes of data or frequently changing data sets. Document databases are particularly useful for handling vast amounts of unstructured data. Unlike SQL databases, which scale vertically, NoSQL databases scale horizontally, making it easier to expand capacity by adding more servers or nodes.

Advantages of nonrelational (NoSQL) databases:

Easy scalability and high availability
Flexible database models
High performance

Disadvantages of nonrelational (NoSQL) databases:

Some databases lack atomicity and data integrity
Absence of standardization

Now that we’ve covered the basics of relational and nonrelational databases, let’s examine the database options provided by AWS.

Amazon RDS (SQL) vs. DynamoDB (NoSQL)

Both RDS and DynamoDB are fully managed by AWS, meaning the company handles the underlying operating system and core components. AWS automates routine tasks such as provisioning, patching, backup, recovery, failure detection and repair, reducing administrative overhead.

Let’s examine the details of each service.

Amazon RDS

Amazon RDS enables users to set up, operate and scale relational (SQL) databases on AWS. It simplifies replication to improve availability and reliability for production workloads. AWS offers six SQL-based database engine options:

Amazon Aurora
MySQL
MariaDB
PostgreSQL
Oracle
Microsoft SQL server

AWS provides various instance types with differing combinations of CPU, memory, storage and networking capacity to suit workload requirements.

Amazon RDS features:

Multi-availability zone deployment for high availability
Read replicas for read-heavy workloads
Automatic backups and patching
Monitoring

Both AWS RDS and DynamoDB provide businesses with fully managed cloud-service options. AWS or a managed cloud-service company, such as Mission, takes care of routine tasks like provisioning, patching, backup, recovery, failure detection and repair. Ultimately, the choice between these two services depends on your specific needs and preferences.

RDS is often favored for enterprise resource planning (ERP), customer relationship management (CRM), financial data and transactional applications. It enables you to establish, operate and scale relational (SQL) databases on AWS, offering a variety of instance types to choose from.

Dynamo DB

On the other hand, AWS DynamoDB is a serverless solution that automatically adjusts table capacity to accommodate demand, requiring no administrative effort from you. Typical use cases for DynamoDB include real-time bidding, shopping carts, mobile applications and high I/O requirements.

When you need help deciding what database to migrate to, an AWS Premier Tier Partner like Mission Cloud can advise you in determining the right database for your needs. Contact Mission Cloud to set up your complimentary one-hour session with a Mission solutions architect to discuss your machine learning questions.

Group
Created with Sketch.

Casey Samulski is Mission Cloud’s senior product marketing manager. He previously worked in 3D printing and was also a content marketer for publications like The Economist. He lives with his wife and two children in California.

Uncategorized

Open Liberty 23.0.0.3 Unveiled: Embracing Cloud-Native Java Microservices, Jakarta EE 10 and Beyond

MMS • A N M Bazlur Rahman

IBM unveiled Open Liberty 23.0.0.3, boasting support for Java SE 20, Jakarta EE 10 and MicroProfile 6.0. This significant release introduces the Jakarta EE 10 Core Profile, Web Profile, and Platform, as well as enhancements to various features that comprise the profiles. Additionally, the release includes the new MicroProfile Telemetry 1.0 specification and updates to the Metrics, OpenAPI, and JWT Authentication specifications. Open Liberty 23.0.0.3 marks a milestone in the runtime’s development since its inception over five years ago.

The Jakarta EE 10 release signifies a major milestone, being the first Jakarta update since Java EE 8 in 2017 and the first since Oracle donated Java EE 8 to the Eclipse Foundation. The release includes numerous updates to existing specifications and introduces the Core Profile, tailored for lightweight runtimes like Open Liberty, to optimize the operation of cloud-native Java microservices.

The Jakarta EE Core Profile, new for Jakarta EE 10, features Context and Dependency Injection 4.0 Lite, JSON Binding 3.0, RESTful Web Services 3.1, JSON Processing 2.1, Annotations 2.1, Interceptors 2.1, and Dependency Injection 2.0. Jakarta Contexts and Dependency Injection (CDI) 4.0 Lite further enhances support for lightweight runtimes and microservices. This streamlined version of CDI 4.0 provides developers with the essential features for building cloud-native Java applications while minimizing resource consumption, improving startup times, and optimizing overall performance. With CDI 4.0 Lite, developers can now leverage the power of CDI in a more efficient and agile manner to meet the ever-evolving demands of modern Java development.

This release also includes Jakarta EE Web Profile 10, encompassing Jakarta EE Core Profile 10, Authentication 3.0, Context and Dependency Injection 4.0, Concurrency 3.0, Expression Language 5.0, Faces 4.0, Security 3.0, Servlet 6.0, Standard Tag Library 3.0, Persistence 3.1, Server Pages 3.1, and WebSocket 2.1.This release presents Jakarta EE Platform 10, which includes Jakarta EE Web Profile 10, Authorization 2.1, Activation 2.1, Batch 2.1, Connectors 2.1, Mail 2.1, Messaging 3.1, XML Binding 4.0 (optional), and XML Web Services 4.0 (optional). To enable Jakarta EE Platform 10 or Web Profile 10 features, developers need to add the respective feature to their server.xml file.


     jakartaee-10.0

For the Core Profile, enable its equivalent by adding specific features to your server.xml file. To run Jakarta EE 10 features on the Application Client Container, developers need to add an entry in their application’s client.xml file. For more information, developers can leverage the Jakarta EE 10 specifications, Javadoc, and content on the differences between Jakarta EE 10 and 9.1.

MicroProfile 6.0 is also a part of Open Liberty 23.0.0.3, bringing with it Jakarta EE Core Profile 10 and enhancements to the MicroProfile ecosystem. The new MicroProfile Telemetry 1.0 feature, along with updates to MicroProfile Metrics 5.0, OpenAPI 3.1, and JWT Authentication 2.1, are all included in this release, ensuring compatibility with the latest industry standards.

Java SE 20 support is another key addition to the Open Liberty 23.0.0.3 release, offering developers access to the latest features and improvements in the Java ecosystem. Additionally, this update includes numerous bug fixes, further enhancing the stability and performance of the runtime.

Developers can get started with Open Liberty 23.0.0.3 using Maven, Gradle, or container images. The release is available for download on the official Open Liberty Downloads page, where the Jakarta EE 10 and MicroProfile 6 packages have been added. Developers seeking assistance can ask questions on Stack Overflow, where the community actively provides support and guidance.

About the Author

A N M Bazlur Rahman

Show moreShow less

Uncategorized

Presentation: On Beyond Serverless: CALM Lessons and a New Stack for Programming the Cloud

MMS • Joe Hellerstein

Transcript

Hellerstein: My name is Joe Hellerstein. I’m a professor at UC Berkeley in computer science, and a fellow at Sutter Hill Ventures. I’m going to be talking about serverless computing, CALM lessons, and a new stack for programming the cloud. There’s a story we like to tell in computing that with every new generation of platform that gets invented, a programming model emerges that allows third party developers to unlock the unique properties of that platform in new and unexpected ways. This story goes back at least as far as the minicomputers of the early ’70s, with Unix and C, for which Ritchie and Thompson were given Turing Awards. We see similar patterns as we look over time at the disruptive platforms that have been invented. What’s interesting is what’s missing from this slide, the biggest platform for computing that humankind has ever assembled, the cloud. We don’t have a programming environment that is suited to its unique physics.

The Big Question

The big question that I have that we’re going to talk about is how will folks program the cloud in that way that fosters the unexpected innovation that takes advantage of its properties. I’ll say before we get into it, that distributed programming is hard. It’s harder than the other challenges we faced on the previous platforms. It entails issues of parallel computing, of the consistency of data that’s replicated across multiple geographies. Of the possibility that pieces of the system have failed while it’s still running. Modern autoscaling challenges where things are supposed to grow as usage grows and shrink as usage shrinks to save money, that only makes it harder. In my mind, programming the cloud is one of the grand challenges for computing over the next decade. It’s something I’ve been working on and I’m very passionate about.

Outline

In this talk, we’re going to have four chapters. In the first we’ll talk about serverless computing, which is an early leading indicator of cloud programming. We’ll talk about state and coordination, which are the hardest things to do in a distributed programming language. Then we’ll talk about some foundations, the CALM theorem that help us figure out when it’s easy to deal with state and when it’s hard. Then finally, I’ll talk about work we’re doing in my group right now on a project called Hydro, which is building a language and compiler stack for programming the cloud.

Serverless: Signs of Interest from Cloud Vendors

Let’s begin with serverless. I view serverless computing as a leading indicator of interest from the cloud vendors in making the platform truly programmable to third parties. Work on serverless computing goes back at least as far as 2014. It’s not brand-new. Lambda was introduced on AWS in 2014. It’s really come to the fore in the last four to five years. The idea of serverless computing particularly as instantiated by something called Functions as a Service, or FaaS. The idea is really simple. The idea is you write a little function, you can write it in your favorite language, Python, JavaScript, Java, you launch it into the cloud. This little function can take an input and produce an output. Once you launch it to one of these FaaS platforms, clients from the internet can send inputs to this thing, and it’ll produce outputs for them at whatever scale of clients you manage to recruit. If you have a very popular function, the FaaS platform will scale up to meet all your demand. If you have a function that’s not being used, the FaaS platform will scale down so you’re not billed for any usage. It’s a really attractive way to think about the cloud and how to program it. The promise of FaaS then, is that it’s like a boundless computer. It’s programmable. It’s as big as you need it to be or as small as you need it to be. It knows no bounds on compute and storage. The reality of FaaS, however, is much less attractive. It’s what I like to call an elastic army of incommunicado amnesiacs. Let me tell you what I mean by that.

Serverless Function Limitations

Serverless functions have the positive aspects that they have this boundless compute. What’s negative about it is the functions themselves have to be simple enough to run on a single computer. They’re basically a laptop’s worth of computation. You can have as many laptops’ worth of computation in the cloud as you need, but each of those computations is completely isolated. This is caused by the fact that serverless platforms don’t let functions talk to each other, there’s no network messages. That means no distributed computation out of a platform that is fundamentally the biggest distributed computer you can imagine. We really handicapped ourselves in terms of the physics of the cloud, we’re not taking advantage of the ability to have functions that span multiple computers.

A second problem is there’s no low latency data storage or access from these functions. They can access remote cloud services like S3 storage, object storage, or remote databases, but they don’t have any local storage for very fast persistence, which means that typically, they’re stateless. They have no information that’s kept across invocations. Then the third big missing piece with these functions and Functions as a Service platforms is that they’re made to reboot every few minutes. Because they have no storage, they didn’t squirrel away in memory what they knew before they were rebooted. They’re basically reborn with no memory, they’re amnesiacs. This is why I call it an army of incommunicado amnesiacs. They have no ability to talk to each other, no ability to remember anything. We wrote about this in a paper that I’d encourage you to read. The spirit of this is not so much that serverless computing is bad, but rather that it skipped some fundamental challenges, and if we solve those challenges, we would really have a programmable cloud in our hands. Let’s get to work.

Serverless as a Leading Economic Indicator

Before we get into that work, though, I do want to point out that serverless computing is really interesting as a leading economic indicator. The first 15 to 20 years of the cloud is what I call the boring revolution, surely a revolution in the sense that folks like Jeff Bezos were able to overturn the industry and shift billions of dollars of revenue from legacy enterprise vendors like IBM and Oracle, to cloud vendors that didn’t previously exist, like AWS and Google Cloud. Of course, Microsoft was able to weather this transition internally. That’s a big deal. The truth is, it’s still legacy enterprise software. What can you get in the cloud today? Mostly, you get databases, and queuing systems, and application services, and load balancers, and all the stuff we already had before. From a fundamental computing perspective, it’s not that exciting. It seems somehow, we haven’t taken advantage of what the cloud could let us do in terms of innovation.

For 15 years or so, the likes of Jeff Bezos were off pulling off this revolution, they weren’t worried about enabling third-party developers to take advantage of the new platform. They were busy taking advantage of it themselves. Now they’ve won that battle. In 2022, they start to have incentives to grow by exposing the platform and fostering more innovation. I think the time is ripe to really start answering this question, how do we program the cloud? I think there’s economic incentives for it now. All we need is for the infrastructure folks who build the programming models, the systems that do the runtimes behind those programming models, we need to roll up our sleeves and say yes to the hard challenges that earlier generations like Functions as a Service took a pass on.

The good news is that in that 15 years, when the cloud vendors were stealing business from the enterprise vendors, a bunch of us were off doing research. We’ve been quietly able to do a bunch of research. Now it’s time to start to harvest that research, put it together into artifacts that people can really use to leverage the power of the cloud. We’re going to roll up our sleeves. I think we can do this. The goal really is general-purpose cloud computing, without compromises, without really narrow corner cases. How do you let most people write programs that harness the full power of the cloud? Three main goals, simplicity. It should be easy for developers to get on-ramped into this new programming platform: easy to learn, easy to debug, easy to operate. Correctness, we want the programs to run as intended. That is tricky in the cloud, as we’re going to talk about. Then, the dynamics of cost and performance. We want efficient code by which we mean, yes, it should run fast. It should also only consume the resources that it needs. It should be efficient in terms of my cloud bill, not just in terms of my time.

Toward Generality: Embracing State

To get to that general-purpose computing, the thing we’re going to have to do, the hard thing is to deal with what’s called state. In the animation, I’m embracing the state where I live, and you might want to embrace yours. What I mean by state here is the data that functions are computations, generate and manage within and across invocation. If you call a function, it generates some information that it keeps in RAM, and then if you call it a second time, it might need to remember some things from the past. I call all of that state. It’s really data that’s either in memory or could be on storage devices as well. There’s two challenges with state, one is hard, one is less hard. The really hard one is a correctness challenge called distributed consistency. We’re going to talk about it quite a bit. It’s a correctness problem. It’s a barrier to simplicity. It really makes programming distributed systems hard to get right, and it’s unavoidable. The second challenge is data placement and data movement. Which data should be where, when, for high performance? This performance problem is the stuff that we as engineers are quite good at. You do a prototype, it’s a little slow. You profile it, you realize the data that was over here should be over there. You do some adaptations, and over time your system gets fast. Challenge number one is the one I want to talk about.

The Challenge: Consistency

What is the challenge of consistency? The challenge is to ensure that agents that are separated by space, agree, or will agree on common knowledge. We have this lovely couple in the upper right, and they are an example of what I call simple data replication. All we’ve replicated is x, a variable. We haven’t replicated a program and all its meanings, we’ve just replicated some state, x equals heart. They both agree that this variable x, which could change, it’s mutable. Right now, it equals heart, and everything’s happy. What happens if they’re separated? They can’t talk to each other. One of them changes the value of that variable, so now unfortunately, the young woman on the left believes that x is poop. While the young man on the right believes that x is heart, what is going to happen? The problem is that with their beliefs, they will move forward and make more fateful decisions, and step down a forking path of these decisions to the point where if we have to put them back together later, they’ve made so many decisions based on different assumptions, that we can’t put them back together in any sensible way. This is what’s sometimes called split brain divergence. The computer brain across these two people has two different parts that can’t be put back together in a sensible way.

This is not a new problem. This is a classic problem in distributed computing, and it’s been solved at some level. On the right you see Leslie Lamport. He won the Turing Award for inventing a consensus protocol called Paxos. What’s a consensus protocol? It’s a protocol that allows multiple computers to agree on a value, like x equals heart. There’s similar protocols from database systems, like the two-phase commit protocol that gets a network of computers to agree on whether or not to commit a transaction. These protocols are well known. I will say, though, that they are tricky. Every time I have to teach Paxos, I really need to bone up on it before I go into the classroom, because otherwise students will get very lost. There’s a ton of details, and it’s not particularly intuitive. Sometimes in engineering, in math, in computer science, stuff is tricky, but you got to do it. It’s worth it because it’s cool. It solves a real problem. This stuff, unfortunately, is both tricky and also bad. We shouldn’t use this stuff. Don’t take my word for it. What do I mean by this?

Coordination

In the upper right of this slide is a picture of James Hamilton. James has been around forever. He was involved in the early IBM databases. He was involved in the Windows file system. Over the last 20, 25 years, he’s been involved in architecting two of the major clouds, both at Microsoft, and at Amazon. He is one of the key architects of the modern cloud. Some 13 years ago, he gave a talk, and the quote is so good I like to read it like poetry. He talks about it like this. “The first principle of successful scalability is to batter the consistency mechanisms down to a minimum, move them off the critical path, hide them in a rarely visited corner of the system, and then make it as hard as possible for application developers to get permission to use them.” It’s as if he’s saying, “Thank you Dr. Lamport for all the Paxos but we’re not going to use that.”

Why is he saying this? Why is he tying our hands and not letting us use the solution to the problem at hand? Coordination of the forms that he’s calling consistency mechanisms, so things like Paxos, two-phase commit, they involve computers waiting for each other to come to agreement. There’s a lot of messages that go back and forth that you have to wait for. That waiting causes other machines to wait, and it builds up queues. I may be waiting for you. Other services may be waiting for me. Services way up the chain may not even know what they’re waiting for anymore. What happens is if some junior developer at a big cloud vendor decides to call Paxos in the middle of their program, because they think they might have a bug otherwise, it can cause a cascading effect of queues throughout the cloud that bring things to their knees. This has been documented in some of the major clouds, especially in the first many years. James’s advice is very pragmatic, we don’t want to use coordination. If we have to use it, only the experts get to use it, and only in a corner of the system.

The problem with coordination mechanisms is that they’re reasoning in a very conservative way, way down at the bottom of the system, at the level of memory accesses, things like memory access, read and write, or disk access, read and write. Databases are classic for this in terms of their transaction semantics. My former student, Peter Baillis, is now at Stanford and has Sisu, which is this startup of his, had this great cartoon in his PhD thesis. It’s an adaptation of The Far Side. It goes like this, what we say to databases is, “Ok, database, now be good. Please move one iPhone out of inventory and into Bob’s cart.” What the database storage manager actually hears is, “Blah, blah, blah, read, blah write, blah read, blah write.” It knows nothing about all that application-level semantics about products and inventories and what it means for there to be less than zero things in inventory or too many things in a cart. It doesn’t know any of that. It just knows reads and writes. It’s very cautious about what reads and writes it will allow.

What we’re seeing is a generational shift in the way we’re going to solve the problem. The Lamport-era 20th century approach was to reason at the storage level, and make worst-case assumptions. What we’re going to talk about, and what we’re going to see in more modern research in the 21st century is going to look at application specific assumptions. What do we know about our application, and how it uses its data and its state, and how the computations in the application affect that state? At some level, the stuff about memory access is really not where the action is anymore. If we want to do this well, we’re going to be reasoning about programs, not about I/Os.

When and Why Do We Need Coordination?

The big question, when you start walking down this path of application semantics is, when do we truly need stuff, like Paxos, like Lamport’s work? Why do we need it, and when can we avoid it? This question, when do I need coordination and why? If you ask a traditional computer scientist, ask most computer scientists, why do we need a lock in our program? That’s a little example of a local form of coordination. Why do I need Paxos? They’ll tell you, “We have conflicts around some resource, and if we don’t coordinate the order in which people access that resource in space and time, bad things will happen.” A typical example in the physical world is this intersection, hasn’t been coordinated, and two cars can’t be in the same place at the same time, and so you get this problem. There’s an obvious solution to this thing, which is fix the stoplights. Stoplights are great, because then the north-south guys can go for a while. Then the east-west crowd can go for a while. Then the north-south again. At any given time, at least we’re utilizing that resource of the intersection. The problem with that, of course, is if the arrival rate of cars is faster than the drain rate that happens while the light is green, we get infinite pileups at these intersections, even with stoplights. What are we going to do? We have to have stoplights. Physics says you can’t be in the same place at the same time. Maybe there’s a clever engineering solution that doesn’t break physics. In fact, there is one and you see it all the time. It’s to use another dimension in space, we’ll build overpasses, and then we don’t have to have stoplights. These overpasses can go at full bandwidth all the time. Of course, you’ve been stuck in traffic. There’s other reasons why traffic backs up besides coordination. The idea that we can avoid coordination in cases where we might have thought we have a contention or a race condition makes the question really hard, when do we need coordination really? When is there a clever solution to work around it and not have coordination?

It’s a theory question really, at the end of the day. Let’s move to our whiteboard. Suppose you understand your program semantics, which programs have a coordination-free implementation? If we could just find it, a clever programmer could build it without using coordination. Those are the good programs, we put them in the green circle. The rest of the programs are the programs that require coordination. There is no programmer in the world who could come up with a solution to this problem without using coordination. It’s absolutely intrinsic to the statement of the problem. I want to know, what’s the green stuff, because the green stuff, I can make that autoscale and run faster in the cloud and all these good things. The red stuff, I’m going to have to bite the bullet and use coordination. That’s the stuff that James Hamilton doesn’t want the junior programmers to write. What is that green line? Can you give me a test that will separate the red programs from the green programs? This is a computational complexity or computability problem? What are the programs computable without coordination versus the programs that require coordination? What we’re going to do next is look at that theory. It’s called the CALM theorem. It’s going to inform how we can actually build systems. What we’re going to see is that monotonicity is going to be that green circle, the bright line test for whether coordination is needed. We’re going to talk about that, some lessons, and some performance payoffs.

CALM Theorem

Here’s the statement of the CALM theorem as it was conjectured in a talk I gave back in 2010, and then proved the very next year by an extremely brilliant graduate student at the University of Hasselt, in Belgium, Tom Ameloot. The theorem says this, a distributed program has a consistent and coordination-free distributed implementation, if and only if it is monotonic. The green line test is monotonicity of the problem. If it’s monotonic, we can get a consistent coordination-free implementation. If it is not monotonic, it’s outside the green circle, there is no consistent coordination-free distributed implementation. If you want to dig into this research, I recommend you start with a short six-page overview paper that Peter Alvaro and I wrote in CACM, just a couple years ago. It’s pretty accessible, and it points to the relevant papers where you can dig deeper. The papers are beautiful.

Let me give you some intuition. We’re going to need some definitions, particularly of our key words, consistency, monotonicity, and coordination. We won’t get formal definitions, that’s what the proofs are for, but we will get some intuition. For consistency, we’re not going to use an I/O consistency like Lamport did, we’re going to use an application-level consistency. I want the outcomes of the code of the application to be the same everywhere in the network, regardless of whatever shenanigans are going on with the network. Messages getting delayed, or reordered, or having to be resent multiple times, so we receive them lots of times. Regardless of all those shenanigans, I want everybody to compute the same thing. The test we’re going to apply is monotonicity. Monotonicity is a property of a function that says if you give it a bigger input, you get a bigger output. Formally, on the left, it says that f’s monotone is defined as if x is a smaller thing than the y, then the function on x is a smaller thing than the function on y. Put in a bigger thing, namely y, get a bigger outcome, namely f of y. The cool thing about this is you can think of this as a streaming environment, start by computing at x, all of its outputs are correct. Then if you grow to include the stuff that’s in y that wasn’t in x, you can compute that, and all the stuff that was from x will still be correct.

With monotonic programs, the early results are guaranteed to be in the final results, so you can stream output to the user without regret. This should give you a hint at why this is coordination free. Every computer in the system, the minute it knows something is true, is guaranteed that thing will be true in the final outcome, and so it can emit the things it knows true right away without asking anybody, no coordination. Which leads to the hardest question actually, turns out, how do we define what coordination is? That’s where Ameloot’s proof is really brilliant. He shows that the way to think about coordination is, there are the messages that we have to wait for, even if it turns out we have all the data. Suppose I know everything about the answer, and the only thing I don’t know is that no one else knows anything about the answer. Then I’m going to run around to everybody saying, do you know anything else I should know? Only after everybody else have no idea what you’re talking about, then I can output things. That work I was doing with everybody, that counts as coordination. The actual work to compute the answer is not coordination, the work to check that nobody knows anything that I need to know, that’s coordination.

Easy and Hard Questions

Again, the CALM theorem says that the monotonic programs are exactly the set of programs that can be computed coordination free with a consistent outcome. Which leads to some easy and hard questions that we can ask of you. The first question is, are any of you over 18? If any one of you sent me a text message or an email or whatever that said, I’m over 18. Then I’d have my answer and I’d be done. What about if I asked the question, who is the youngest person to watch this video? That one, first of all, suppose you are the youngest person who watches the video, the only way you’ll know that you’re the person who should send me a text is by asking everybody else, are you older than me? Worse than that, suppose somebody younger than you comes along tomorrow and watches the video, then you’re not the right answer after all. You shouldn’t have said you were the right answer. It’s non-monotonic. The more people who come along, the answer might change. We can’t stream outputs. We might have to retract outputs that we released early.

Let’s look at a rephrasing really of the right-hand question. All we’re going to do is we’re going to rephrase it to say, who is the person that nobody is younger than? I’ve done the thing where we do a double negation, where we do De Morgans Laws. Nobody is younger than, so let’s read it out. There exists an x such that there does not exist any y, where y is less than x. There is no person younger than this person. The reason I rewrote this is it’s really easy to look at this stuff in logic now and see what’s monotone and what’s not. The thing on the left there exists. The minute we see something that exists, we’re done. Anything more that comes along, only makes that thing more true. It was true, and it remains true no matter what happens. The thing on the right, it’s got that not sign in front of the exists. When you see not, it’s non-monotone, because we might start with there not existing a y. As we introduce more y’s, we might find one that does exist, and so we’ll start out by saying, Mary is the youngest person watching this video. There’s nobody younger than Mary until somebody younger than Mary comes along. That not exists is something we can’t test locally. We need to know a lot of everything to test not exists. That negation is the key to non-monotonicity. Very easy to distinguish monotone from non-monotone in these logic expressions.

The CALM Theorem

That gives you a flavor for the CALM theorem. Monotonicity is the bright line test for whether coordination is needed. You might be asking a pretty natural question at this point, which is like, why do you need to know this? Very nice. It was good to go to a talk and learn a little computer science, but like, what’s it for? I’m going to try to argue in four different ways that you should understand this, and you should ingest it and somehow use it in your work. We’ll do it first to look back at some familiar ideas in distributed computing to some of you, the CAP theorem. Then we’ll talk about right now, what can you walk away with from this talk as a design pattern for thinking about how to build systems that go real fast, and are monotone. Just over the horizon, stateful serverless computing, how can we have serverless functions, like we talked about at the beginning, that have state, that have data, and keep them correct? Then finally, we’ll transition to the last part of the talk, talk about languages and compilers that can help all of us program in ways that take advantage of monotonicity when we can. Then try to move coordination, as James Hamilton suggested, into the corner, into the background.

A CALM Look Back at CAP

The CAP theorem was a conjecture by my colleague, Eric Brewer at Berkeley. It says that you only get two out of the following three things, consistent outcomes, availability of your system, that is that it’s on and live and accepting updates, and partitioning in the network. Partitioning means that at least one computer can’t talk to at least one other computer. A way to think about this is if you have a partition, if there’s two computers that can’t talk to each other, you have two choices, either you can turn off the service, and then it’s not available. Or you can let the service run with these two different computers that can’t talk to each other, and you’ll end up with a lack of consistency. You’ll end up with split brain. You won’t be able to put it back together when the network heals. That’s the CAP theorem. For 15 years, people talked about the CAP theorem, it just doesn’t seem quite right. I bet I can beat it. There was a proof of the CAP theorem that came out of MIT, Gilbert and Lynch. An unspoken assumption of that proof, and in a lot of the discussion of the CAP theorem advocates, is that the kind of consistency we mean, is that low level I/O consistency that requires coordination. The definition of consistency is one that requires coordination in its definition.

As I said, that’s tired in today’s world. What the CALM theorem does, is it explains when and why we can beat the CAP theorem and get all three of consistency, availability, and partitioning. Remember that we can have consistency that’s coordination free in the green circle. In the green circle, we can have split brain for a while, while things are partitioned. When they come back together, because they’re monotone, they’ll just exchange messages and start outputting more good stuff. We can put the Humpty Dumpty back together, in a coordination-free environment. Coordination-free consistency is possible, even if we’re available under partitioning. What are the coordination-free consistency programs? The monotone programs, that’s the CALM theorem. CALM is explaining where the happy case is. It’s the coordination-free programs. We can get all three of the CAP things, and one of the sad cases. The CAP theorem is really a theorem about that outer red circle, or about a worst-case assumption. This definition of coordination is at the heart of the technical differences in the formal proofs of these theorems. What I can tell you is that while the theoreticians chose their formalisms, Eric and I are on the same page about how this relates to what you and I might want to build in the field. Eric and I are pretty connected to practical use cases, both of us involved in the industry.

Design Patterns from CALM

What I want to talk about now are things you can build in languages like C++ and Java that would benefit from the design patterns of the CALM theorem. What we did in my group is we applied it to the context of traditional software. We picked a little petri dish of a problem, which is a key-value store. We built a key-value store called Anna. I should mention on the upper right, this is Anna’s hummingbird. It’s a kind of hummingbird that was clocked as being the fastest animal for its size on the planet. There was research that was done at UC Berkeley by colleagues, so we like the name. The idea with Anna is that it’s very lightweight, very fast. Just as fast as it needs to be for its size, because it’s very serverless and autoscaling. The key things with Anna are twofold. First of all, it does give you many kinds of consistency guarantees that are coordination free, so things in the green circle. It’ll give you causal consistency if you know what that is. It’ll give you read committed transactions if you’re familiar with transaction levels. It does this with no coordination, which means it is always running full tilt parallel, and it’s able to autoscale. Here’s a couple of different papers on the Anna system that you can go chase down. Great work by Chenggang Wu, the lead author. He won the ACM Sigma Dissertation Award for this work, so a landmark in database systems.

CALM Autoscaling: The Anna KVS

The key thing with Anna is not only is it fast, but it’s autoscaling. We started out with the goal of overcoming conventional wisdom on scaling from the first decade of the cloud, in essence, or of at least network services. Jeff Dean, around 2009, after about a decade at Google, was giving talks saying that when you build a service, you should design for 10x growth, but plan to rewrite it before you get to 100x growth. That’s what we’ve had to do at Google every time. That was good lessons for Google back then but it’s a terrible idea today. Serverless computing is all about my application might not be popular now, but maybe next month, it’ll take off like a hockey stick, and it’ll be 10,000 times more popular than it is today. I as a programmer don’t want to rewrite that application three times between today and tomorrow. I want to write it once. I want it to take up as much compute as it needs, up and down.

What does CALM tell us? What we’re going to do in Anna is we’re going to take the idea of coordination-freeness, and we’re going to use it to enforce certain consistency levels. We’re going to enforce levels like causal consistency. The way we’re going to do it is we’re going to use data structures that only grow. Data structures that are monotone, like sets that we union things into, or counters that only get bigger and bigger. These are what are called semilattices, or if you’ve heard of CRDTs, which I’ll talk about later. These are like simple CRDTs, but we’re composing them into hierarchies so you can have a set of counters of sets, and so on. When you put these things together, each one of them super easy to reason about, but they’re all data structures that just get bigger. We never remove or lower them. We only make them bigger over time. In doing that, we have monotonicity. The data structures themselves are just C libraries. They’re just sets and counters and things. Because we’re using them only with things that grow or methods that grow like union, and increment, then we’re cool.

What this is going to allow us to do is scale across threads without coordination. Here we have three objects in the Anna key-value store and one operating thread. We can scale it up, replicate the objects in certain ways. In this case, there’s two copies of every object, but each node has a different pair. This is good for fault tolerance. Any node can go down, and we still have all three objects. We can also scale up and down a memory hierarchy. Here, what we’re doing is we’re putting cold objects like blue, on disk drives. There’s no point allocating RAM in the cloud for those objects if no one’s accessing them right now. Hot objects like green and red will promote up into the memory tier where things are fast access. We can choose to have things that need to go low latency and fast in memory. Things that are not used very often and cold can be far away. This is the control we want over state. We want state to be close to us, if we need it to be close to us, and far away if it doesn’t need to be close to us. We also have the ability to scale back down when the system changes usage, and we have fewer users. Anna can do all this really easily.

CALM Performance

The performance we get out of Anna is phenomenal. What you’re seeing up here, the Anna architecture essentially shares no memory. Even individual threads just talk to each other with messages, and computers talk to each other with messages. What we’re seeing is as we add threads to this thing, which are the individual ticks on the x axis, it scales linearly in this very smooth way. When we saturate a machine with all its threads, so we have a 32-core machine here, we just go to the next machine, and it just keeps scaling, with almost no interruption in the smoothness of that. If you know anything about building parallel and distributed systems, you know that getting smooth scaling like this is extremely difficult. When you’re doing coordination-free computing, it’s often very easy. Anna has this beautiful scaling phenomena. More impressive, though, is that it is crazy fast even under contention. That’s the case where there’s a few objects that are very hot, everybody wants to write them and read them and write them, and lots of objects that are pretty cold. In most systems, those hot objects determine your throughput. In Anna, you can have lots of copies of those hot objects, lots of people can be updating them, and it’ll just keep going at maximum performance. What that gives you is ridiculous performance improvements over the competition.

Masstree was the fastest in-memory multi-threaded key-value store we could find at the time from research, came out of Harvard. Anna was 700 times faster in these experiments with contention. It was 10 times faster than Cassandra in a geo-distributed deployment with this contention. It was 350 times the cost performance from DynamoDB. It was 350 times faster for the same price than DynamoDB. The reason for this is very clear, it’s CALM and coordination free. That means that as we’re inserting things into this key-value store and reading them, we never have to do any waiting. We have no atomic instructions, no locks, no Paxos, no waiting ever. What that causes in performance when you do your performance breakdowns is that the competition on the lower lines of the chart on the right, TBB is Intel’s Thread Building Blocks library. It’s Intel’s fastest library for hash table. Spends 95% of its time in this workload retrying atomic instructions. These aren’t locks, so it’s lock-free TBB. It is atomic so it needs to successfully update the thing before somebody else peeks at it. It fails, and so 95% of its time, over and over, it’s trying to do atomics. Atomics are coordination. They’re just not locks, they’re a different kind of coordination. Anna, on the other hand spends 90-plus percent of its time just doing PUTs and GETs. This is what’s called good PUT. Anna is doing only good PUT. Ninety-percent of its time is spent doing good stuff. Anna is giving you consistency, this is not just the who knows key-value store, it’s guaranteeing you things like causal consistency, or read committed.

Lessons from Staying Monotonic

Some lessons from staying monotonic. If you can figure out how to have your data structures and what you do with them only get bigger over time, monotonicity, then you can update those things at any time, in any place have copies at will. The system will have maximum good PUT even under contention, and you have the ability to be really profligate about your replication, so you can do things like replicate horizontally to your peers for fault tolerance, or for load balancing, or for latency. You can have copies of the data close to your users geographically. You can also replicate vertically to faster caches. If you want data in your serverless functions, for example, you want them to be stateful. Or you can move the data to slow storage if it’s cold. You pay more dollars only for the things that are hot. That’s how you beat the heck out of something like DynamoDB.

Stateful Serverless Computing

We talked about what we can do, now let’s talk a little bit about what’s just over the horizon. We’ve already prototyped a couple years ago at Berkeley, the idea of Functions as a Service, but with state. The trick was to use the ideas of Anna, in our Functions as a Service environment, to do updates locally and remember them as long as the data types are monotone. One of the things that was really interesting about Ameloot’s CALM theorem papers, he showed a third equivalence. I said consistency is logical monotonicity, CALM. He pointed out that the monotonic programs and the coordination programs are the same, and they’re also the same programs that can be computed by oblivious actors. What’s an oblivious actor? It’s an actor that doesn’t know who all the other actors in the system are. It doesn’t even know its own identity. It’s just one of an army of clones, not knowing who’s in the army that just computes when it gets messages. It sends messages, gets messages, but it doesn’t really know who’s around. It’s just doing its thing. Obliviousness is just the property we want for autoscaling. It means that if I don’t need to know the population, I can add nodes at will, I can remove nodes at will, and the individual nodes just keep doing their thing. The programs you can execute in that environment are the monotonic, coordination-free programs. It’s all the same. It is possible and we’ve shown it in CloudBurst, to build a Functions as a Service environment. It’s oblivious, so the functions don’t know who all is running. It’s stateful, and communicating, so it’s removing those restrictions that we worried about early in the talk with serverless systems. It’s got the freedom to scale up. We can add another node, start copying state to at any time. Scale down. We can stop taking updates at a node, copying state elsewhere, and decommission it.

We’re using this architecture that Chenggang and Vikram, the leads on this work, have gone off and started a company called Aqueduct, which is doing serverless model serving, so take machine learning models. They need to do inference or prediction at scale, and you want those things to scale up and scale down with use. That’s what Aqueduct is all about. It’s an open source library. I encourage you to go check it out if you do machine learning or MLOps. What I was going to close with those, that the shortcuts for Functions as a Service, the statelessness and the lack of communication aren’t necessary if you commit to monotonic code. Monotonicity is the key property that was missing.

What More Could We Want?

So much for the CALM theorem. Now I want to talk about Hydro and what we’re doing to make it easier to program the cloud. First, let’s ask the question, what more could we want? I told you all these great things about monotone programming, but what I didn’t focus on is the idea that non-monotone stuff happens. James Hamilton acknowledges that sometimes in the corner of the system, you need to use coordination. We really do want to have a programming environment for general-purpose computing, that has a mix of monotone and non-monotone stuff. We might want strong consistency. We might want transactions, sometimes. I/O guarantees, maybe we want that, we should be able to have it. Simple thing, I might want to know when my monotone program terminates. Let’s go back to my question of, is anyone here over the age of 18 watching this talk? I’m sitting here recording the talk and I don’t know the answer yet. No one’s texted me. I don’t know if the answer is no, nobody is over the age of 18 watching this talk, this talk is strictly for kids. Or just, I have to keep waiting. That idea of termination detection, even for monotone programs that haven’t finished, it requires coordination. I could keep waiting, and somebody might come along and say I’m over 18, and that’s good. Then I’m done. Until that happens, I just have to wait. I might like to know like, is there really nobody over 18? That’s a non-monotone question. For all the people in the world, who are going to watch this, are they under 18? That’s a non-monotone question. Termination detection requires coordination. Then there’s some algorithms that just computational complexity tells us they require coordination. The non-monotone programs don’t cover all of polynomial time, which is most of what we do in algorithms. They cover a large fragment of it, but there are programs that you might want to compute algorithms that require coordination or that use coordination to get efficiency gains, so I want that option.

The second thing I want is I want a language, compiler, a debugger that’s going to help me with these problems. It’s going to address the real concerns of distributed computing. What are the concerns of distributed computing? Here’s one, is my program consistent? Even though maybe I’m not sure it’s monotone. Is it consistent? Can I take the state in my program and partition it across lots of machines by sharding it? Some of the state’s on one machine, some of the state’s on a second machine, and so on. If I do that, will my program still be correct? What about if I replicate the state particularly if it’s non-monotone, is it going to work out or not work out? Do I need to use coordination? Do I not need to use coordination? Where in the program should I put the coordination? What about failures? What failures can my program tolerate? What if a machine goes down? What if a network message gets dropped? How many failures can I tolerate before the system goes offline, or starts giving wrong answers? Then, what data is moving around in the system where? Who can see it? Is that ok? What if I wanted to control it, how would I control where the data goes? Take any one of these questions and go to your favorite traditional compiler, like the LLVM stack that we all use for compiled programs these days, and they’ll look at those questions and pretty much shrug their shoulders. Modern compilers were built to handle the problems you get on your PC, the problems you get on your phone. They were not built to answer the programming problems that arise in the cloud. I want a language, a compiler, a debugger that can answer these kinds of questions. Then, of course, locally on the individual machines we’ll call LLVM to compile the local code. That’s fine. It does a lot of good things. Hydro is going to try to address these kinds of challenges.

Inspiration is a Query Away: SQL

The inspiration for this is something we’re all familiar with. It’s just a query away. It’s SQL. SQL, the database query language is the single biggest success story in autoparallelized programming. Take an SQL query, run it on your laptop, take it to the cloud, run it on a million machines. It’s that simple. Not only that, we’ve been doing this since the 1980s. There’s a project at Wisconsin called Gamma, led by Dave DeWitt, who should win a Turing Award for this work, that showed that you could take SQL queries and parallelize them like crazy. He proved it by building it on a very early parallel computer that was incredibly hard to work with. Around the same time, a company started called Teradata to do the same thing. Teradata is still around, they’re still cooking. They can run stuff really fast across lots of machines. This has been known since the ’80s. A lot of it was reinvented in the big data craze in the 2000s. Honestly, most of the innovation has been around for SQL since the beginning.

Relational queries scale like crazy. Serializable transactions do not scale like crazy. As we take lessons away from databases, we should focus on the query languages, the query processors, the query optimizers, and not worry about these low-level I/Os things, like transactions. That’s a separate concern that we talk about separately.

I want to highlight a thing that goes back to the dawn of relational databases, and Ted Codd, who also won the Turing Award for this. The idea was that data moves around on disks over time, and queries stay the same. We need to build database systems, namely relational systems, that hide how data is laid out and hide how queries are executed. All you do is you say your query, and the system is in charge of worrying about, the data is laid out in a certain way, so I’ll run the query in a certain way. The cloud was invented to hide how computing resources are laid out, and how general-purpose computations are executed. The cloud in a lot of ways, is in the state that we were in with database systems in like 1969. It’s waiting for somebody to come by and invent a programming model that empowers us to hide this stuff. There’s lots to learn from database systems.

Our First Approach: Relational, Data-Centric Programming

The first approach we took in my group to solving this problem was a very relational, data-centric programming language that we called Bloom. Bloom is still available. You can play with it. It’s an embedded language inside of Ruby, actually, about 10 years old now. We took a theoretical language for relational queries called Datalog, we extended it to deal with time and space, so distributed systems problems. We call that Dedalus. Then, Bloom was the practical programming language we were showing to developers that took Dedalus into something that was programmable. We demonstrated many of the benefits that I’m alluding to in prototypes of the Bloom language that weren’t particularly fast implementations and they weren’t particularly easy to learn. There they are, and we wrote papers about them. Folks who are really interested in this area, I encourage you to have a look. What we had built in the Bloom era was a walled garden. We really didn’t focus on making this easy for people to learn, incrementally adoptable from existing languages. We didn’t worry about how it integrated with other languages, and we didn’t really worry about whether it was that fast, just that it was coordination-free and scalable.

Then the other thing I’ll point out about Bloom that we started fixing, and we’re continuing to fix in new projects, is that a lot of common constructs we want in computing are pretty clumsy to do with SQL or another relational language. The classic one is counters, which are monotonic and just be going up. They’re super commonly used in distributed systems, and we use them in a lot of our coordination-free tricks. They’re very hard to do in SQL or a relational language like Datalog. Another thing that’s hard to do with those languages is data that really wants to be ordered. I have a list of things, it’s ordered, and I want to preserve its order. That’s very clumsy to do in relational languages. It can be done, but it’s ugly. The programming model is not always friendly to the programmer in this environment.

Competing Approach: CRDTs

There was a competing approach around the same time called CRDTs. These are an object-oriented idea. The idea is that you have objects that have one method, the method is called merge. It’s an object class with a merge function that takes two CRDTs and merges them to make one big CRDT. Things merge together like amoebas, so these objects can get bigger. The only requirement is that your merge function can be anything you like, as long as it obeys the following rules. It is associative, which means that you can batch these merges up into batches of any size, and it still gives you the same answer. It’s commutative, so the order in which you merge things doesn’t matter to the outcome. It’s idempotent, so if you merge the same object in twice, it doesn’t change the outcome. In mathematics, there’s a name for a mathematical object that is a type with such a merge function that’s associative, commutative, and idempotent, it’s called a semilattice. This is an object-oriented interface to semilattices. Because we’re computer geeks, we give it a new acronym, we call it CRDTs.

Unfortunately, CRDTs are broken. I like to call them the Hotel California of distributed state because they only have a merge method. You can merge state anytime you like, but you can never read safely. If you observe the internal state of a CRDT, you freeze it in time, you say what’s in there, and you open the package and you try to read from it. That breaks the guarantee of eventual consistency that they’re giving you. Depending on when you read, you may see stuff may be non-monotone and change over time. CRDTs are broken. They sound like they’re promising you correctness. They promise you correctness if you never look at your data. They don’t promise you correctness if you do look at your data. Still in the spirit of Anna, they’re inspiring. They allow us to start thinking about data structures that only get bigger. CRDTs are data structures that only get bigger in a formal sense. They’re getting a little bit of adoption. People are starting to pay attention to CRDTs. It’s even beginning to have a little impact in some commercial products. Of course, they’re being used by developers in ways, hopefully, that avoid the bad cases that they don’t guarantee to handle. I view them the way I view serverless functions or NoSQL, is they’re flawed, but they’re a leading indicator that people want something and that we could build something even better.

Another Influence: Lessons from Compiler Infrastructure

I talked about relational languages. I talked about CRDTs. Another big influence on the work we’re doing is just compiler infrastructure the way it’s done today, which is basically to have stacks of languages. The canonical example is LLVM, which is a very influential piece of software. The key idea in LLVM, there’s many ideas, but the one that architecture is most important is that it has its own language in the middle, an intermediate representation, an IR as it’s called in the programming languages community. You can have many languages at the front of LLVM, like C and FORTRAN and Haskell in this picture, but since then Rust and Swift and a bunch more. First, you translate those to LLVM’s internal IR, which is a language. You could actually type it. Then the LLVM optimizer works on that IR to generate an appropriate machine language for the backend, which could be one of many machine languages. It’s really three languages. There’s the input on the left, there’s the IR in the middle, and there’s the machine code on the right, all of which, in principle, you can type in text. If you’re a performance sensitive person, you can jump in at these different layers and hack. There are people who change their machine code, not recommended. There are people who will change the LLVM IR that’s generated from the frontend, might sometimes be worth it.

Hydro’s Inspiration

Hydro is a new programming language for the cloud. It’s inspired by logic languages like SQL, because they give us the analytic power to look for correctness and optimization. If you remember back to our example with the questions, the exists and not exists questions, the for alls, those are logic languages. You saw those expressions where we could just look for the not side and be able to tell that something was non-monotone. Logic languages are like that, SQL is like that. Inspired by lattices, because really what lattices are, is they’re generalizations of what’s good about relations to other data types. They allow us to have Grow-Only things like sets of tuples, which is what relations are. They don’t have to be sets or tuples, they could be things like counters, they could be things like maps of other kinds of lattices. Lattices are composable, and they’re quite general. It generalizes what we already knew about from database systems, if we use them in sophisticated ways. We’re inspired by functional dataflow, things like MapReduce, Spark, and Pandas. Actually, if you crack open any SQL database, there’s a functional data flow runtime inside of it, so it’s no surprise that MapReduce, Spark, and Pandas look a lot like SQL systems, or even support SQL as Spark does, because really, they’re just the inside of a database like Gamma. Then finally, compiler infrastructure stacks, as we talked about, things like LLVM. We have these multiple languages targeting both generalists and experts. We have a vision paper that we wrote last year, “New Directions in Cloud Programming,” that lays out our agenda for what we want to do and why we think we can do it, given what’s known in the field as of a year ago.

Hydro: A Programming Stack for the Cloud

This is Hydro in a block diagram. It’s a programming stack for the cloud. At the top, many languages that you can program in, much like the left-hand side of the LLVM picture. In the middle, an IR that’s declarative and hides the resources and the execution that’s in the cloud, very much like an SQL style language just says what you want, not how to do it. Our challenge is to compile into a declarative IR. Then we have a compiler that translates that into a Dataflow program, and, in particular, into a Dataflow program of lattices. In essence, we’re taking the idea from CRDTs, but we’re turning it into a composable, rich, low-level programming model that can build extensions to the ideas of things like Spark. Then that Hydroflow program actually generates Rust code. The Rust code goes into LLVM, it turns it into executables. Then the challenge is to deploy them and scale them up and down. That’s the Hydro stack.

Initial Wins from Hydro

What I’ll tell you is we’re starting to get some early wins in the stack after a year of work. The first one that’s come out as a paper is surprising to me, from the team, because it’s the one I thought was the biggest science fiction. It’s the idea of taking a traditional sequential program written in a language like C or C++, and translating it automatically into a language like HydroLogic. In particular, we simply translated it into the CRDT. All we’re doing is we’re taking sequential data structures, and converting them into semilattices. This uses techniques called verified lifting that my colleague at Berkeley, Alvin Cheung invented. The lead student on this, Shadaj Laddad, is just a rockstar, amazing guy. He knocked this out so quickly, it was really impressive. I’m quite optimistic now that getting things into a IR like HydroLogic is going to be within our grasp in the next year or two.

Another piece of the puzzle, compiler optimizations for distributed programs. My student David Chu won an award at the ACM SOSP competition last year for Best Student Research Pitch, when he pitched this problem. The idea is to take techniques like CALM analysis, and apply them to protocols in distributed systems like Paxos. Actually, what you find is even though Paxos is a coordination protocol, inside of it, there’s monotone components that can be scaled up. Even inside the classic protocol for coordination, there’s things we can do underneath to scale it. The theme here is that, sure, not all programs are monotone, but most programs are mostly monotone, even programs like Paxos. The challenge here is not only to replicate with CALM, but also to shard the program intelligently. He’s using ideas essentially from SQL to do sharding. Then the goal here is to get compiler guarantees of correctness.

Then, finally, at the bottom of the system is this high-performance kernel for distributed execution, local data flow on local machines with network ports at the sides that communicate with low latency. In essence, it’s distributed flows of lattices. Then Rust is being used not only to generate really fast code, because it’s an LLVM language, but also to do the type checking for the distributed properties that we talked about in the previous slide. We’re beginning to make progress on the type system part. The Hydroflow runtime, however, is quite mature. I encourage you to have a look at it. It’s got a low-level language that can be used to program it in terms of data flows. We’ve got examples of programs that are written in Hydroflow today.

Hydroflow Example

Here’s a little example of a program that I wrote. It’s a program to build a chat server. This is the entire code in Hydroflow for that server, and the first six lines are really just about setting up the network ports. All the logic of the program is in the last four lines. This model of simple data flows or simple lattice flows at the bottom of the system is actually very expressive and compact. It’s really cool that you can get programs like this to be so small.

Key Takeaways

Embrace state in serverless computing, don’t settle for statelessness. Avoid coordination. You can do both, even though currently the vendors won’t let you. We can embrace state and avoid coordination in the monotone case. Move to the 21st century, when we talk about consistency, let’s stop talking about reads and writes or stores and accesses, let’s talk about the consistency of our application outputs. Then, the centerpiece of this whole thing foundationally is the CALM theorem, which is, if your programs are monotone, then there is an implementation that’s coordination free and is consistent. If your program is not monotone, no such implementation could exist. Finally, I showed you the example of how you can build a system where the data structures only grow, so the monotone data structures, and that system will be coordination free, and it’ll just go like crazy. This Anna programming model, the way that it was architected, I encourage you to look at that paper. The architecture is super simple. The thing just scales like crazy. It’s super-fast. Finally, the blue lollipop is the one that’s not quite ready yet, but I can see it over the horizon. New language stack for the cloud. The Hydro project at Berkeley is exploring this and I expect that we’ll have really usable results from this based on years of prior research. We’ll have usable results from this coming up in the next few years. Programming the cloud from my perspective is a grand challenge for computing. I hope more folks get involved, kicking the tires, generating ideas, building competitors. Let’s go solve this problem.

Resources

For more information, you can look at the Hydro website, hydro.run. All our code is on GitHub at hydro-project, particularly the Hydroflow kernel. It’s quite mature at this point. It’s going to be probably released into v 0.1 towards the end of 2022. The Sky Lab at Berkeley, sky.cs.berkeley.edu, you can learn about that more. The Aqueduct serverless scalable machine learning platform is at aqueducthq.com. If you have to pick three papers, you want to read the serverless paper, “One Step Forward, Two Steps Back,” the explainer paper in CACM on the CALM theorem, six pages long, and the vision paper for the Hydro project. Those are three you might want to pay particular attention to.

See more presentations with transcripts

Remult, a Crud Framework for Fullstack Typescript

MMS • Guy Nesher

Defining Entities

Server Side Setup

Client Side Integration

Contacts

About the Author

Guy Nesher

Subscribe for MMS Newsletter

Did you know...

Podcast: Hien Luu on ML Principles at DoorDash

MMS • Hien Luu

Subscribe on:

Transcript

Introduction [00:17]

ML Principles at DoorDash [02:15]

What makes ML difficult? [04:11]

Concrete tips for improving [05:58]

Biggest blockers [06:58]

How to improve as a company [09:07]

Feature stores [13:48]

GPU Computing [16:08]

About the Author

Hien Luu

Subscribe for MMS Newsletter

Did you know...

Vercel Announces New Storage and Security Offerings for the Egde – InfoQ

MMS • RSS

About the Author

Steef-Jan Wiggers

Subscribe for MMS Newsletter

Did you know...

Kubernetes 1.27 Released with Freeze on Legacy Image Registry and In-Place Update of Pod Resources

MMS • Mostafa Radwan

About the Author

Mostafa Radwan

Subscribe for MMS Newsletter

Did you know...

Java News Roundup: JEPs for JDK 21, Spring Cloud AWS 3.0, OptaPlanner to Timefold

MMS • Michael Redlich

OpenJDK

JDK 21

Spring Framework

Infinispan

Open Liberty

Quarkus

Helidon

Apache Software Foundation

Arquillian

OptaPlanner Transitions to Timefold

About the Author

Michael Redlich

Subscribe for MMS Newsletter

Did you know...

.NET Community Toolkit 8.2: MVVM Toolkit Attributes, Performance Enhancements, and More

MMS • Almir Vuk

About the Author

Almir Vuk

Subscribe for MMS Newsletter

Did you know...

Vercel Announces New Storage and Security Offerings for the Egde

MMS • Steef-Jan Wiggers

About the Author

Steef-Jan Wiggers

Subscribe for MMS Newsletter

Did you know...

Diving into AWS Databases: Amazon RDS and DynamoDB Explained – The New Stack

MMS • RSS

Relational (SQL) vs. Nonrelational (NoSQL) Databases

Relational (SQL) Databases

Nonrelational (NoSQL) Databases

Amazon RDS (SQL) vs. DynamoDB (NoSQL)

Amazon RDS

Dynamo DB

Subscribe for MMS Newsletter

Did you know...

Open Liberty 23.0.0.3 Unveiled: Embracing Cloud-Native Java Microservices, Jakarta EE 10 and Beyond

MMS • A N M Bazlur Rahman

About the Author

A N M Bazlur Rahman