Jim jimathyp

## databricks-query-csv-snip.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / databricks-query-csv-snip.md
            
            
              Last active
              November 3, 2022 19:33
            
          
    Databricks read from CSV

%py
path= ""
df = spark.read.csv(path, header=True)
df.cache()
df.createOrReplaceTempView("csv_data")
display(df)


## postgres-enum.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / postgres-enum.md
            
            
              Created
              October 20, 2022 05:34
            
          
    Postgres enums

ERROR: when adding an enum to a table -  column "my_column" contains null values

CREATE TYPE my_enum AS ENUM ('value1', 'value2');
ALTER TABLE some_table ADD COLUMN my_new_column my_enum NOT NULL;


## teams-meeting-time-bug.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / teams-meeting-time-bug.md
            
            
              Created
              October 4, 2022 19:11
            
          
    MS Teams bugs

Can't set meeting time exactly?

use space and AM or PM after the meeting time (even for 24 hour time!)
https://techcommunity.microsoft.com/t5/microsoft-teams/problem-with-meeting-start-and-end-times-in-microsoft-teams/m-p/2109126

Our engineering team is aware of the situation and working towards a solution. [...] The issue reported is expected to be (fixed) by next quarter (October - December 2022). 


## databricks-errors.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / databricks-errors.md
            
            
              Last active
              September 28, 2022 01:03
            
          
    Databricks Errors

ExecutionException: org.apache.spark.SparkException: Exception thrown in awaitResult: 


delta log - 403 forbidden - check permissions.
Works from a different cluster.
Data not permitted to be accessed from that cluster.


## airflow-local.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / airflow-local.md
            
            
              Last active
              September 27, 2022 21:23
            
          
    Running Airflow locally

sqlite No such table job

sqlite3.OperationalError: no such table: job

[SQL: INSERT INTO job (dag_id, state, job_type, start_date, end_date, latest_heartbeat, executor_class, hostname, unixname) VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?)]


## Spark DDL.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / Spark DDL.md
            
            
              Last active
              August 25, 2022 21:41
            
          
    Spark DDL

comments
alter table some_database.some_table column reason comment 'some comment';

Unclear - but can't add to if source data already delta and already exists? delta specified schema does not match

  
## python-context-managers.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / python-context-managers.md
            
            
              Last active
              August 25, 2022 23:41
            
          
    Python context managers

context managers provide - the 'with' construct
with blah:
  do-something()

  
## spark-df-functions.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / spark-df-functions.md
            
            
              Last active
              August 24, 2022 20:38
            
          
    Spark functions

dir(df)
['__class__',
'__delattr__',
'__dict__',
'__dir__',
'__doc__',


## databricks-spark.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / databricks-spark.md
            
            
              Last active
              September 21, 2022 05:41
            
              
                Spark on Databricks
              
          
    Spark on Databricks

Notebook comes with a spark session
print(dir())             # sc, spark, sql, sqlContext
print(type(spark))       # <class 'pyspark.sql.session.SparkSession'>
print(type(sc))          # <class 'dbruntime.spark_connection.RemoteContext'>
print(type(sql))         # <class 'method'>  Help on method sql in module pyspark.sql.context

  
## aws-vault-errors.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jimathyp
                / aws-vault-errors.md
            
            
              Created
              August 18, 2022 20:52
            
          
    AWS Vault errors

$ aws-vault exec some_profile -- ./some_bach_script.sh
aws-vault: error: exec: exec format error

Script was missing a shebang #!/bin/bash -> FIXED