lauralorenz/README.md

## README.md

      
    Raw
  

              README.md
            
          
    Notes from cloud2 demo


Install Prefect :-)
sign up for Cloud 2.0 beta

prefect.io
Prefect 2.0 banner or (Product > Cloud 2.0)
Get Started
Sign up / Sign in
Make API key


Initial CLI setup

prefect cloud login


You can run a flow right now and the metadata will be stored in Cloud because of the CLI setup
Configure storage

Create S3 bucket in your AWS account
prefect storage create with your AWS bucket name and API keys
prefect storage set-default {id} if it wasn't already your default (prefect storage ls to see all storage configs)


Create a work queue

Make in the UI or with the CLI command: prefect work-queue create -fr KubernetesFlowRunner k8s-queue


Create a deployment with prefect deployment create {yourfile}.py
Deploy agent using kubectl apply -f agent-manifest.yaml or a similar file with name of your work queue

Demo part 0

Create deployment with prefect deployment create basicflow-subprocess.py
Start a local agent that will pick up local jobs

prefect agent start local-queue


Deploy prefect deployment run 'Demo/local-example'
Edit the concurrency for the db tag:

prefect concurrency-limit create db 1
Run again prefect deployment run 'Demo/local-example'


Demo part 1

Create deployment with prefect deployment create basicflow-k8s.py configured with 'dev' tag.
Consider two clusters’ node pools

Dev - using t1.micro node pools. teeny tiny. really only good for one job at a time.
Staging - using something bigger (for demo I used t3.medium). can run multiple jobs at once.


Create new work queues for ‘dev’ and ‘staging’ tags, still for the k8s flow runner

Limit the ‘dev’ work queue to only one at a time
Limit the ‘staging’ work queue to 10 jobs at a time


Create new agents, one for each work queue in the appropriate cluster
Start up multiple runs for the deployment; they will go to the dev agent.
Edit the flow to change the deployment's tag to 'staging' and re-deploy and run it, and it will now go to your staging agent.

Demo part 2

Given a deployment tagged with staging from prefect deployment create basicflow-k8s.py.
Set a concurrency limit on the db tag.
Rerun as above.


## agent-job-access.yaml
# The SA used by your agent workload needs these k8s RBAC permissions
# in order to track the job it starts for KubernetesFlowRunner configured deployments
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: orion-job-access-clusterrole
rules:
- apiGroups: [""]
  resources: ["pods", "pods/log"]
  verbs: ["get", "list", "watch"]
- apiGroups: ["batch", "extensions"]
  resources: ["jobs"]
  verbs: ["get", "list", "watch", "create", "update", "patch", "delete"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: SA-use-orion
subjects:
- kind: ServiceAccount
  name: default # choose the k8s SAs your workloads will use
  namespace: default # choose the namespace your k8s SA is in
roleRef:
  kind: ClusterRole
  name: orion-job-access-clusterrole
  apiGroup: rbac.authorization.k8s.io

## agent-manifest.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: k8s-dev-queue
spec:
  selector:
    matchLabels:
      app: k8s-dev-queue
  replicas: 1
  template:
    metadata:
      labels:
        app: k8s-dev-queue
    spec:
      containers:
      - name: agent
        image: prefecthq/prefect:2.0b2-python3.8
        command: ["prefect", "agent", "start"]
        args: ["$(WORK_QUEUE)"]
        imagePullPolicy: "IfNotPresent"
        env:
          - name: WORK_QUEUE # you can configure this as a value directly instead if you like
            valueFrom:
              fieldRef:
                fieldPath: metadata.labels['app']
          - name: PREFECT_API_URL
            value: https://api-beta.prefect.io/api/accounts/{your account id}/workspaces/{your workspace id}
          - name: PREFECT_API_KEY
            value: {your api key}

## aws-deployment-tips.md

      
    Raw
  

              aws-deployment-tips.md
            
          
    AWS Deployment stuff

A few tips I collected along the way from various docs setting up an EKS cluster for the cloud2 demo.
Create a cluster

There is a lot of setup for an EKS cluster so its easiest to use the provided eksctl utility which automates some of it for you.
eksctl create cluster -f cluster.yaml
You can describe a cluster.yaml with modifications to the default behavior, like the following:
# cluster.yaml
apiVersion: eksctl.io/v1alpha5
kind: ClusterConfig

metadata:
  name: dev 
  region: us-west-2 

nodeGroups:
  - name: dev-ng-1
    instanceType: t2.micro
    desiredCapacity: 2
Give yourself console access

If you want to be able to view workloads from the console, you need to give your user access. This eks-console-full-access.yaml is the most permissive of a few provided options on the AWS docs here.
kubectl apply -f eks-console-full-access.yaml
kubectl edit -n kube-system configmap/aws-auth
mapUsers: |
    - userarn: arn:aws:iam::{your account}:user/{your user}
      username: {k8s username}
      groups:
        - system:cluster-admin
        - eks-console-dashboard-restricted-access-group

Give your workloads permission

Your workloads need permissions to your storage location, to read flows and to write results; if you are using S3 as your storage location, you will configure this through AWS IAM.
See more details about this on the AWS docs here.
Setup IAM OIDC

Check OIDC ID.
aws eks describe-cluster --name {cluster} --query "cluster.identity.oidc.issuer" --output text
See if it's already been set.
aws iam list-open-id-connect-providers | grep {oidc issuer}
Add it if not.
eksctl utils associate-iam-oidc-provider --cluster {cluster} --approve
Bind IAM policy to k8s SA

Define a policy that has the AWS IAM your agent workload needs; as mentioned above, the policy needs at least access to your storage bucket configured against Cloud to read flows and write results to.
Then, you can use eksctl to create a k8s SA and bind that IAM policy to the k8s SA in one command.
eksctl create iamserviceaccount \
    --name default \
    --cluster dev \
    --role-name "orion-access" \
    --attach-policy-arn arn:aws:iam::{your account}:policy/{your policy} \
    --approve \
    --override-existing-serviceaccounts

Or annotate k8s SA

Or, you can annotate a k8s SA direclty so it knows what IAM role it should use when contacting AWS services. My workloads are using the default/default SA, but if your agent workload is configured to use a different k8s SA, annotate that one.
kubectl annotate serviceaccount -n default default \
eks.amazonaws.com/role-arn=arn:aws:iam::{your account}:role/orion-access

Give kubernetes flow runner other in-cluster rbac for job runs

kubectl apply -f agent-job-access.yaml

  
## basicflow-k8s.py
import os
import time
from random import randint
from anyio import sleep
from prefect import flow, task, get_run_logger
from prefect.task_runners import ConcurrentTaskRunner
from prefect.deployments import DeploymentSpec
from prefect.flow_runners import KubernetesFlowRunner, KubernetesImagePullPolicy

@flow(name="Demo", task_runner=ConcurrentTaskRunner)
async def basic_flow():
    for i in range(10):
        await solo_task()
    for i in range(10):
        await concurrent_task()

@task()
async def concurrent_task():
    t = randint(1, 10)
    logger = get_run_logger()
    logger.warning(f"Sleeping for {t}")
    sleep(t)

@task(tags=['db']) # this tag is used to set a task concurrency limit
async def solo_task():
    t = randint(5, 15)
    logger = get_run_logger()
    logger.warning(f"Sleeping for {t}")
    sleep(t)


DeploymentSpec(
    name="k8s-example",
    flow=basic_flow,
    tags=['staging'], # this tag is used to filter by my work queues
    flow_runner=KubernetesFlowRunner(
                    image="prefecthq/prefect:2.0b2-python3.8",
                    image_pull_policy=KubernetesImagePullPolicy.ALWAYS,
                    env={
                            'PREFECT_API_URL': os.getenv('PREFECT_API_URL'),
                            'PREFECT_API_KEY': os.getenv('PREFECT_API_KEY'),
                        }
    )
)

## basicflow-subprocess.py
from prefect import flow, task, get_run_logger
from prefect.deployments import DeploymentSpec
from prefect.flow_runners import SubprocessFlowRunner

@flow(name="Demo")
def basic_flow():
    for i in range(100):
        message()

@task(tags=['db']) # this tag is used to set a task concurrency limit
def message():
    logger = get_run_logger()
    logger.warning("The fun is about to begin")


DeploymentSpec(
    name="local-example",
    flow=basic_flow,
    flow_runner=SubprocessFlowRunner()
)

## basicflow.py
from prefect import flow, get_run_logger
from prefect.deployments import DeploymentSpec
from prefect.flow_runners import SubprocessFlowRunner

@flow(name="Demo")
def basic_flow():
    logger = get_run_logger()
    logger.warning("The fun is about to begin")


DeploymentSpec(
    name="basic-example",
    flow=basic_flow,
    flow_runner=SubprocessFlowRunner(),
)
	# The SA used by your agent workload needs these k8s RBAC permissions
	# in order to track the job it starts for KubernetesFlowRunner configured deployments
	apiVersion: rbac.authorization.k8s.io/v1
	kind: ClusterRole
	metadata:
	name: orion-job-access-clusterrole
	rules:
	- apiGroups: [""]
	resources: ["pods", "pods/log"]
	verbs: ["get", "list", "watch"]
	- apiGroups: ["batch", "extensions"]
	resources: ["jobs"]
	verbs: ["get", "list", "watch", "create", "update", "patch", "delete"]
	---
	apiVersion: rbac.authorization.k8s.io/v1
	kind: ClusterRoleBinding
	metadata:
	name: SA-use-orion
	subjects:
	- kind: ServiceAccount
	name: default # choose the k8s SAs your workloads will use
	namespace: default # choose the namespace your k8s SA is in
	roleRef:
	kind: ClusterRole
	name: orion-job-access-clusterrole
	apiGroup: rbac.authorization.k8s.io
	apiVersion: apps/v1
	kind: Deployment
	metadata:
	name: k8s-dev-queue
	spec:
	selector:
	matchLabels:
	app: k8s-dev-queue
	replicas: 1
	template:
	metadata:
	labels:
	app: k8s-dev-queue
	spec:
	containers:
	- name: agent
	image: prefecthq/prefect:2.0b2-python3.8
	command: ["prefect", "agent", "start"]
	args: ["$(WORK_QUEUE)"]
	imagePullPolicy: "IfNotPresent"
	env:
	- name: WORK_QUEUE # you can configure this as a value directly instead if you like
	valueFrom:
	fieldRef:
	fieldPath: metadata.labels['app']
	- name: PREFECT_API_URL
	value: https://api-beta.prefect.io/api/accounts/{your account id}/workspaces/{your workspace id}
	- name: PREFECT_API_KEY
	value: {your api key}
	import os
	import time
	from random import randint
	from anyio import sleep
	from prefect import flow, task, get_run_logger
	from prefect.task_runners import ConcurrentTaskRunner
	from prefect.deployments import DeploymentSpec
	from prefect.flow_runners import KubernetesFlowRunner, KubernetesImagePullPolicy

	@flow(name="Demo", task_runner=ConcurrentTaskRunner)
	async def basic_flow():
	for i in range(10):
	await solo_task()
	for i in range(10):
	await concurrent_task()

	@task()
	async def concurrent_task():
	t = randint(1, 10)
	logger = get_run_logger()
	logger.warning(f"Sleeping for {t}")
	sleep(t)

	@task(tags=['db']) # this tag is used to set a task concurrency limit
	async def solo_task():
	t = randint(5, 15)
	logger = get_run_logger()
	logger.warning(f"Sleeping for {t}")
	sleep(t)


	DeploymentSpec(
	name="k8s-example",
	flow=basic_flow,
	tags=['staging'], # this tag is used to filter by my work queues
	flow_runner=KubernetesFlowRunner(
	image="prefecthq/prefect:2.0b2-python3.8",
	image_pull_policy=KubernetesImagePullPolicy.ALWAYS,
	env={
	'PREFECT_API_URL': os.getenv('PREFECT_API_URL'),
	'PREFECT_API_KEY': os.getenv('PREFECT_API_KEY'),
	}
	)
	)
	from prefect import flow, task, get_run_logger
	from prefect.deployments import DeploymentSpec
	from prefect.flow_runners import SubprocessFlowRunner

	@flow(name="Demo")
	def basic_flow():
	for i in range(100):
	message()

	@task(tags=['db']) # this tag is used to set a task concurrency limit
	def message():
	logger = get_run_logger()
	logger.warning("The fun is about to begin")


	DeploymentSpec(
	name="local-example",
	flow=basic_flow,
	flow_runner=SubprocessFlowRunner()
	)
	from prefect import flow, get_run_logger
	from prefect.deployments import DeploymentSpec
	from prefect.flow_runners import SubprocessFlowRunner

	@flow(name="Demo")
	def basic_flow():
	logger = get_run_logger()
	logger.warning("The fun is about to begin")


	DeploymentSpec(
	name="basic-example",
	flow=basic_flow,
	flow_runner=SubprocessFlowRunner(),
	)