rynowak/README.md

## README.md

      
    Raw
  

              README.md
            
          
    Radius & Kubernetes Interop


Status: SUPER pending
Author: Ryan Nowak (@rynowak)

Overview

To adopt Radius, we ask users to describe their application using Bicep and Radius' application model. Moving customer workloads to Radius' application model supports our strategy for the project, we're trying to evangelize a broadly-applicable cloud-native abstraction and then build multiple implementations of that abstraction. As the end goal: users that adopt Radius can seamlessly move between Kubernetes, on-premises platforms, and serverless hosting platforms provided by various clouds. Unfortunately migrating existing applications complicated and dangerous work. To make that worse, few customers actually need the kind of hosting platform portability we're planning, and we're a long ways off from building it. We believe that these factors explain much of the hesitation we're seeing from customers when they have experienced the problems that Radius is solving, but don't become enthusiastic about using Radius.
Building a universal cloud-native abstraction is a bold and radical goal, and results in users being shown a radically-different experience compared to what they are using today. However, without both the desire and and means to move applications between hosting platforms this is not radically better for users as evidenced by the low adoption that we've seen.
This document proposes an alternative strategy focused on adoption of Radius and it's concepts by 1) replacing less of the user's deployment tooling, 2) providing a quicker time-to-value by being more incremental to adopt. Users will be able to quickly benefit from features like Recipes and Connections without replacing or rewriting existing deployment assets.
While the proposal details a different experience for users, there is little proposed change for our architecture and investments in Radius other than documentation and samples. The proposal is that we delay the evangelism and messaging of a cloud-native abstraction until we have critical mass and instead focus on messaging ease-of-adotion and time-to-value. In the meantime it is our choice how much to continue investment in the abstraction and its associated tooling (Bicep). We would retain and continue to invest in UCP and the Radius resource providers as these are the basis for features like Recipes, Dashboard and the Application Graph.
Terms and definitions


Term
Definition


Application model
A syntax and schema for describing a cloud-native application that can be deployed and run by one or more hosting platforms.


Hosting platform
A platform for running cloud-native applications. This is more than just a container runtime, it also includes networking, storage, configuration systems, and potentially more features. Examples include: AWS ECS, Azure ContainerApps, Kubernetes, Google Cloud-Run.


Cloud-native abstraction
An application-model that is decoupled from any specific hosting platform. As a counter-example: the Kubernetes object model is not an abstraction. Data types like: Deployments, Pods, and Services are fundamental to Kubernetes itself.


Application graph
A feature of the Radius API that provides users with a graph describing the topology and dependencies of their deployed applications. Can be viewed at either the application, environment, or whole enterprise scope.


Golden templates
A design pattern where a central team maintains templates for common (blessed) architectural patterns. Application teams can use these templates with minimal knowledge of how to configure the underlying systems. A common example is a Helm chart written by platform engineers where the application team provides configuration like the container image.


Objectives

Goals


User can adopt Radius trivially for existing Kubernetes applications.
Users can leverage existing investments in Kubernetes-specific toolchains like Helm, Flux, Argo, ect.
Users can benefit from valuable features like connections and recipes with minimal change and minimal effort.

Non goals


Changes in the overall Radius architecture.
Changes to existing developer experiences with Bicep and our resource types.
Abandoning the cloud-native abstraction or making choices that prevent us from building one in the future.

User scenarios (optional)

Developer: Onboarding an existing Kubernetes application to Radius

As an application developer at Contoso, I am responsible for the maintenance of a microservices application made up of 3 containerized services. My team is responsible for the application code (business logic & UI) as well as the Dockerfiles and Helm Charts used to deploy the containers to Kubernetes. None of us are experts on Helm and Kubernetes, but the things we're doing are simple so we can get by.
We're adopting Radius as a company so that we can automatically catalog all of our cloud assets. It's my job to onboard our services to Radius so that the architecture can be catalogued. To do this, I edit each Helm chart and add an annotation to each Kubernetes Deployment indicating that Radius should be enabled (radapp.io/enabled: 'true'). When I do this, I'm just changing metadata and no behaviors of my application change. When I merge the pull-request the metadata is updated in my deployments, which automatically triggers the create of a Radius application and container resources that belong to that application. This updates the  application-graph for our whole company and now my components are visible.
This was easy to do because it only required features I already use (labels and annotations) and did not change any behaviors of my application or the deployment tooling I use.
Platform Engineer: Authoring golden templates for developers

As a platform engineer at Contoso, I am responsible for developing and documenting golden templates. Each of these templates is a Helm chart that provides a packaged version of the deployment assets for a common architecture like a website + PostgreSQL database. The application developer teams that use these templates only provide the configuration for the things they care about. When I write the templates I can ensure that operational requirements like metrics or health checks are configured correctly.
The templates that we've written so far only contain the compute, networking, and configuration resources for containers - we don't have a good way to include dependencies like databases because they aren't part of Kubernetes. Instead we have pre-requisites. Application teams can pre-provision a database through our internal Backstage portal, and then can map the configuration for the database into a Kubernetes secret. The golden template ensures that the configuration from the secret will be mapped into the container when it gets deployed.
I learned about Radius recently, and I want to use recipes to remove this step from the workflow so it can be automated. I'm starting with the website + PostgreSQL template - what I want to do instead is use a recipe to create PostgreSQL and the secret - this was previously a manual step. Since we're adopting Radius I also want to make sure these templates work with the application graph.
First I ensure that all of the resources in the template have Radius enabled (radapp.io/enabled: 'true'). I can do this with annotations on the Kubernetes objects. Since my template will be used for multiple applications, I let the user configure the application name. That's all that was required to onboard the application, next I want to start using a recipe. I add a Recipe resource to the Helm chart, and I configure it to output the connection data to the same secret as before. Lastly I add a connection between the website container and the Recipe so we can document the connection.
Getting onboarded with Radius was very quick because it works with the tools that we're already using. This enabled us to easily remove a manual provisioning process and manual management of secrets.
Design

The proposal is to add a Kubernetes-centric configuration experience focused only on developer scenarios (the Bicep/API experience includes operator scenarios). This will enable trivial adoption of Radius for applications that use the Kubernetes application model (Deployment, Service, etc.) for configuration and use the common Kubernetes toolchains for application deployment (Helm, Flux). For the developer that lives in Kubernetes already, they will not need to learn Bicep, Terraform or rad to use Radius.
Operators will continue to use the rad CLI and Bicep/Terraform for their work when interfacing with Radius. We believe that we provide exceptional value to operators by acting as a bridge between the cloud(s) and the many Kubernetes clusters they use.
The new capabilities will include configuring a Kubernetes Deployment to behave as an Applications.Core/containers resource and to create Recipes of all types (including all portable resources + all cloud resources). The new configuration can be added to a  Deployment using labels and annotations, similar to Dapr's configuration. Without expensive migration, users can start using features of containers from a Kubernetes Deployment like automatic IAM, connections, and the application-graph.
Users will be able to create recipes using a new CRD type (Recipe) that can be used for any type of resource that supports recipes. This will also support configuring a Kubernetes Secret as an output of the recipe which makes interop seamless with anything else in Kubernetes.
Other types in the Radius application model are not important for this scenario and can be ignored for now (Volumes, SecretStores, Gateways). The Kubernetes application model already provides these features.

The new features are implemented using a Kuberentes controller that must be deployed to the cluster where applications are running. This new controller is responsible for reacting to changes to the Kubernetes objects and mirroring those changes to the Radius API. The controller also modifies the user's Deployment objects to inject connection data similar to our current functionality.
The impact on the overall architecture can be depicted by the following diagram showing both the existing Bicep flow side-by-side with the new Kubernetes-flow.

  
      flowchart LR
  u1[User 1]
  u2[User 2]
  k[Kubernetes Resources]
  c[Radius Controller]
  de[Deployment Engine]
  rp[Applications.* Resource Providers]
  u1--helm install-->k
  k--Kubernetes resources-->c
  c--Radius resources-->rp
  u2--rad deploy-->de
  de--Radius resources-->rp

    
      Loading

  
Graphic: side-by-side depiction of `helm install` and  `rad deploy`.
The new Radius Controller will be client of our existing resource providers. Its function is to project the understanding of Kubernetes resources into Radius. As a result the Kubernetes controller can inject data back into Kubernetes, and also contribute to the application graph.

This architecture strongly supports the use case where Radius is hosted centrally, and a slim agent runs on each Kubernetes cluster in a federation model with the centralized control-plane.

  
      flowchart LR
  d1[App Team 1]
  k1[Kubernetes Cluster 1]
  d2[App Team 2]
  k2[Kubernetes Cluster 2]
  d3[App Team 3]
  k3[Kubernetes Cluster 3]
  r[Radius]
  op[Operator]
  d1--deploy-->k1
  d2--deploy-->k2
  d3--deploy-->k3
  k1<--sync-->r
  k2<--sync-->r
  k3<--sync-->r
  op--uses-->dashboard
  dashboard<-->r

    
      Loading

  
Example Scenario

This is a walkthrough our existing demo application (tutorial) from the point-of-view of onboarding an existing Kubernetes application. The application is a TODO website that uses Redis as a database.
Starting Assumptions:

Redis is managed by the user to start with (probably without IaC).
The user manually (or with a script) copies the connection info into a Kubernetes secret. Secrets
The application is deployed using a Helm chart.
We can assume the Radius environment has already been configured using rad init.

Step 0: At the beginning

This YAML file describes the Kubernetes Deployment that will run the website. For brevity we're omitted details of the chart that won't change in this walkthrough.
As mentioned earlier, Redis is not part of the Helm chart, just the website. Notice that the container image is configured using the embedded template language - versioning is one of the typical reasons users need Helm in the first place.
The application code reads the DB_CONNECTION environment variable. This deployment manifest expects that the Redis URL is present in the url key of the redis-secret object.
apiVersion: apps/v1
kind: Deployment
metadata:
  name: demo
spec:
  selector:
    matchLabels:
      app: demo
  template:
    metadata:
      labels:
        app: demo
    spec:
      containers:
      - name: demo
        image: {{ .Values.container.image }}
        env: 
        - name: DB_CONNECTION
          valueFrom:
            secretKeyRef:
              name: redis-secret
              key: url
        ports:
        - containerPort: 3000
Step 1: Onboard to Radius

In this step we're adding a single annotation: radapp.io/enabled: 'true'.
This makes the Deployment visible to Radius and will make it appear as part of the application graph. For this case we will infer the application name from the namespace, and the resource name from the Kubernetes object name (demo).
This is all that is needed to onboard. From this point on we're adding value and improving the user's workflow.
apiVersion: apps/v1
kind: Deployment
metadata:
  name: demo
  annotations:
    radapp.io/enabled: 'true'
spec:
  selector:
    matchLabels:
      app: demo
  template:
    metadata:
      labels:
        app: demo
    spec:
      containers:
      - name: demo
        image: {{ .Values.container.image }}
        - name: DB_CONNECTION
          valueFrom:
            secretKeyRef:
              name: redis-secret
              key: url
        ports:
        - containerPort: 3000
Step 2: Add Recipe

In this step we've added a Recipe Kubernetes object. This will use the registered recipe on the environment to provision a Redis cache, and then will update the redis-secret secret.
At this point we haven't made any code changes to the application code, and only a few changes to the Helm chart, but we're now onboarded to recipes! We can remove the manual provisioning of Redis and the manual management of the secret.
apiVersion: apps/v1
kind: Deployment
metadata:
  name: demo
  annotations:
    radapp.io/enabled: 'true'
spec:
  selector:
    matchLabels:
      app: demo
  template:
    metadata:
      labels:
        app: demo
    spec:
      containers:
      - name: demo
        image: {{ .Values.container.image }}
        env: 
        - name: DB_CONNECTION
          valueFrom:
            secretKeyRef:
              name: redis-secret
              key: url
        ports:
        - containerPort: 3000
---
apiVersion: radapp.io/v1
kind: Recipe
metadata:
  name: db
spec:
  type: Applications.Link/redisCaches
  secretName: 'redis-secret'
Step 3: Using connections

In this step we're going to change the environment variable using used for the Redis URL from DB_CONNECTION --> CONNECTION_REDIS_URL (Radius convention). By doing this we can replace the management of the secret and binding it to environment variables and replace it with radapp.io/connection-redis: 'recipe:db' which declares a connection.
We can optionally remove radapp.io/enabled: 'true'. Declaring a connection allows us to infer that Radius should be enabled for the Deployment.
apiVersion: apps/v1
kind: Deployment
metadata:
  name: demo
  annotations:
    radapp.io/connection-redis: 'recipe:db'
spec:
  selector:
    matchLabels:
      app: demo
  template:
    metadata:
      labels:
        app: demo
    spec:
      containers:
      - name: demo
        image: {{ .Values.container.image }}
        ports:
        - containerPort: 3000
---
apiVersion: radapp.io/v1
kind: Recipe
metadata:
  name: db
spec:
  type: Applications.Link/redisCaches
Design details

API design (if applicable)


Alternatives considered


Test plan


Security


Compatibility (optional)


Monitoring


Development plan


Open issues


Radius and Kubernetes interop

This is an approach to integrate with Kubernetes more deeply
Examples
Term	Definition
Application model	A syntax and schema for describing a cloud-native application that can be deployed and run by one or more hosting platforms.
Hosting platform	A platform for running cloud-native applications. This is more than just a container runtime, it also includes networking, storage, configuration systems, and potentially more features. Examples include: AWS ECS, Azure ContainerApps, Kubernetes, Google Cloud-Run.
Cloud-native abstraction	An application-model that is decoupled from any specific hosting platform. As a counter-example: the Kubernetes object model is not an abstraction. Data types like: `Deployments`, `Pods`, and `Services` are fundamental to Kubernetes itself.
Application graph	A feature of the Radius API that provides users with a graph describing the topology and dependencies of their deployed applications. Can be viewed at either the application, environment, or whole enterprise scope.
Golden templates	A design pattern where a central team maintains templates for common (blessed) architectural patterns. Application teams can use these templates with minimal knowledge of how to configure the underlying systems. A common example is a Helm chart written by platform engineers where the application team provides configuration like the container image.