Mehmet Ali "Mali" Akmanalp makmanalp

## README.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                makmanalp
                / README.md
            
            
              Created
              September 21, 2023 15:16
            
              
                Why "let's do force index on every query we have" might not be helpful
              
          
    TLDR: well intentioned but ultimately unhelpful IMHO. Here's why:

It's easy to make a judgement about bad query plans based on an extremely biased sample: To give you a sense of the variety of queries we have: as of today there are over 180k unique query fingerprints at HubSpot. Let's ignore the trivial ones: about 18k unique query fingerprints do > 1000 queries/sec. To be sure, query planner bugs are real, and I'm currently fairly sure we've hit one here (details later) but of the total a miniscule amount is /truly/ (more on this later) query planner silliness.
By contrast, humans can be quite bad at figuring out what index a query needs and will compare dismally to the above success rate if they start doing FORCE INDEX on everything manually. I mess it up often. I see smart, competent, experienced engineers mess it up quite literally every day. People have attempted to codify rules for this exhaustively - every time I scroll through that page I


## gist:ddffd79bdbd75fbff5126c69eb07c1bb
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:22:01 tablet prod_iad-1360915300 still has decreasing replication lag of 208.710618394 seconds, will continue waiting
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:01 tablet prod_iad-1360915300 has caught up on replication
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:01 (prod_iad-1360915300) checking health
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:06 (prod_iad-1360915300) succeeded 1 of 3 healthchecks
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:11 (prod_iad-1360915300) succeeded 2 of 3 healthchecks
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 (prod_iad-1360915300) succeeded 3 of 3 healthchecks
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 getting replication status for replicas
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 getting replication status for master
ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 comparing GTIDSets for err

## README.md

      
              7 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                makmanalp
                / README.md
            
            
              Last active
              February 6, 2019 00:12
            
              
                Orchestrator OOM investigation
              
          
    Summary:

When we run a rolling restart on our orchestrator statefulset, the node that is the previous master will get stuck in a crash loop.
Findings so far:


The pod that gets stuck in a crash loop seems to be the node that used to be the master.
When you delete the pod, it somehow gets out of the crash loop.
Using pprof, Leo tracked the crash to within the martini web framework (used by orchestrator -  Profile map, problem region), while writing the response.
More specifically, the X-Forwarded-For header in the response (which is supposed to contain the IP addresses of each proxy the server goes through) seems to [accumulate](https://github.com/golang/go/blob/b5be877ba4318422547068b85c673639cd843b7d/src/net/http/httputil/


## validate.py
from sqlalchemy import Column, Integer, String, DateTime, Boolean
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy import event
import datetime

Base = declarative_base()


def validate_int(instance, value, oldvalue, initiator):
    # Assigning a string to an Integer column will try to coerce it to the

## network.py
import pandas as pd
import json

def read_network(file_name, nodes_field="nodes", edges_field="edges"):
    network = None
    with open(file_name, "r") as f:
        network = json.loads(f.read())
    nodes = network[nodes_field]
    edges = network[edges_field]
    other_fields = {x:network[x] for x in network.keys()

## get_circleci_artifact.py
#!/usr/bin/env python

import requests

from ansible.module_utils.basic import AnsibleModule

import traceback
try:
    from urllib.parse import quote
except ImportError:

## data_store.py
"""
Simple filesystem organization scheme. You have:

    - Objects: A logical "thing", e.g. a document or a page, with unique IDs
    - Keys: A type of data that we're storing about the object, like the
    location of margins on a page, or the locations of each text box.
    - Files: For a specific object under a specific key, you can have multiple
    files, e.g. image files for each column in the page

Generally you might want to store data in a specific object's key:

## stata_dask.py
import dask.dataframe as dd
from dask.dataframe.utils import make_meta
from dask.delayed import delayed
import pandas as pd

from itertools import chain


def get_stata_dask_meta(file_name, meta_chunksize=10000, *args, **kwargs):
    """Load up first bit of the file for type metadata info. We have to resort

## .block
license: mit
scrolling: yes

## selfjoin.py
FourDigit = aliased(HSProduct)
TwoDigit = aliased(HSProduct)
Section = aliased(HSProduct)

product_data = db.session\
    .query(
        FourDigit.id.label("product_id"),
        FourDigit.code.label("product_code"),
        FourDigit.name_en.label("product_name"),
        Section.id.label("section_id"),
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:22:01 tablet prod_iad-1360915300 still has decreasing replication lag of 208.710618394 seconds, will continue waiting
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:01 tablet prod_iad-1360915300 has caught up on replication
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:01 (prod_iad-1360915300) checking health
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:06 (prod_iad-1360915300) succeeded 1 of 3 healthchecks
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:11 (prod_iad-1360915300) succeeded 2 of 3 healthchecks
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 (prod_iad-1360915300) succeeded 3 of 3 healthchecks
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 getting replication status for replicas
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 getting replication status for master
	ads-0-backup-1552296000-l2zs8 backup INFO 2019/03/11 19:24:16 comparing GTIDSets for err
	from sqlalchemy import Column, Integer, String, DateTime, Boolean
	from sqlalchemy.ext.declarative import declarative_base
	from sqlalchemy import event
	import datetime

	Base = declarative_base()


	def validate_int(instance, value, oldvalue, initiator):
	# Assigning a string to an Integer column will try to coerce it to the
	import pandas as pd
	import json

	def read_network(file_name, nodes_field="nodes", edges_field="edges"):
	network = None
	with open(file_name, "r") as f:
	network = json.loads(f.read())
	nodes = network[nodes_field]
	edges = network[edges_field]
	other_fields = {x:network[x] for x in network.keys()
	#!/usr/bin/env python

	import requests

	from ansible.module_utils.basic import AnsibleModule

	import traceback
	try:
	from urllib.parse import quote
	except ImportError:
	"""
	Simple filesystem organization scheme. You have:

	- Objects: A logical "thing", e.g. a document or a page, with unique IDs
	- Keys: A type of data that we're storing about the object, like the
	location of margins on a page, or the locations of each text box.
	- Files: For a specific object under a specific key, you can have multiple
	files, e.g. image files for each column in the page

	Generally you might want to store data in a specific object's key:
	import dask.dataframe as dd
	from dask.dataframe.utils import make_meta
	from dask.delayed import delayed
	import pandas as pd

	from itertools import chain


	def get_stata_dask_meta(file_name, meta_chunksize=10000, args, *kwargs):
	"""Load up first bit of the file for type metadata info. We have to resort
	FourDigit = aliased(HSProduct)
	TwoDigit = aliased(HSProduct)
	Section = aliased(HSProduct)

	product_data = db.session\
	.query(
	FourDigit.id.label("product_id"),
	FourDigit.code.label("product_code"),
	FourDigit.name_en.label("product_name"),
	Section.id.label("section_id"),