javi santana javisantana

## 1.md

      
              1 file
            
          
              0 forks
            
          
              1 comment
            
          
              0 stars
            
          
                javisantana
                / 1.md
            
            
              Created
              April 8, 2024 17:43
            
          
    aasdasd

  
## clickhouse_query_log_replicas.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                javisantana
                / clickhouse_query_log_replicas.md
            
            
              Last active
              February 25, 2021 10:11
            
          
    When you a clickhouse cluster and you run queries on all the replicas it's not easy to get all the queries ran. I use system.query_log all the time to check timings, errors and so on.
So what I do is create a global query_log:
:) create view query_log_all on cluster my_cluster as select * from remote('10.0.0.1,10.0.0.2', 'system.query_log')
So I can inspect queries in all the replicas with a single query:

  
## link
https://web.archive.org/web/20190312223043/http://the-witness.net/news/2012/05/the-depth-jam/

## til_clickhouse_replicas_status.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                javisantana
                / til_clickhouse_replicas_status.md
            
            
              Created
              December 3, 2020 10:33
            
          
    Clickhouse has a pretty good endpoint /replicas_status which gives information about the, guess what, replication status. When you are working on a cluster in which you use replication to increase the amount of QPS you usually have a load balancer before, something like this:
                                 +--------------+
                                 |              |
                       +-------->+  clickhouse  |
                       |         |              |
                       |         +--------------+
                       |


## generating_scripts_from_clickhouse.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                javisantana
                / generating_scripts_from_clickhouse.md
            
            
              Last active
              February 8, 2021 13:28
            
          
    Sometimes you have to move data from one table to a different one. You usually use
insert into target select * from source
This works but have several problems:

materialized columns are not properly copied
it's slow


## union_all_and_push_down_in_clickhouse.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                javisantana
                / union_all_and_push_down_in_clickhouse.md
            
            
              Last active
              November 18, 2020 12:28
            
          
    Looks like filters are pushed down when filtering an "UNION ALL". Also an example on how to use EXPLAIN in clickhouse and a different view of seeing what is going on with the traces, this lines show how much data clickhouse is reading:
Selected 1 parts by date, 1 parts by key, 2 marks by primary key, 2 marks to read from 1 ranges
Reading approx. 16384 rows with 1 streams

The example


## til_dynamic_joinget.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                javisantana
                / til_dynamic_joinget.md
            
            
              Last active
              November 6, 2020 06:18
            
              
                how to generate a map data structure dynamically in clickhouse
              
          
    Clickhouse has a powerful feature, JOIN engines, that allows to prepare a table to be joined with better performance that a regular table (MergeTree, Log...). It also allows to use joinGet to get table values using a key.
Somtimes you don't have a JOIN table but you'd like to use something with the joinGet performance. Unfortunately you can't use joinGet with something created on the fly (well, you could create a temporally join table but you need several SQL queries).
So there is a way to do that, using transform:
with (
  select (groupArray(key), groupArray(value)) from my_table
) as key_value

  
## summing_merge_tree.sql
MacBook-Pro-de-javi.local :) create table multiple_keys (tmp Int32, testMap Nested (a Int32, bKey Int32, value Int32)) Engine=SummingMergeTree() order by (tmp);

CREATE TABLE multiple_keys
(
    `tmp` Int32,
    `testMap` Nested(
    a Int32,
    bKey Int32,
    value Int32)
)

## compile clickhouse on mojave.md

      
              2 files
            
          
              1 fork
            
          
              0 comments
            
          
              1 star
            
          
                javisantana
                / compile clickhouse on mojave.md
            
            
              Last active
              March 11, 2020 05:45
            
              
                compile clickhouse on mojave
              
          
    clickhouse on osx

These are the steps I followed to compile clickhouse on OSX mojave (10.4.3). These might not be the best way to make it compile on OSX,
my knowledge about C++ and build tooling is really limited but they do the work.
clickhouse sha-1: 2ad4df1d6a
compile command (apply the patch below before running cmake)

  
## res.sql
MacBook-Pro-de-javi.local :) select cityHash64(groupArray(cityHash64(*))) from A a asof inner join (select * from B  where ts<toDateTime('1970-01-01 02:00:00')) b on a.id=b.id and a.ts=b.ts where a.ts<toDateTime('1970-01-01 02:00:00');

SELECT cityHash64(groupArray(cityHash64(*)))
FROM A AS a
ASOF INNER JOIN
(
    SELECT *
    FROM B
    WHERE ts < toDateTime('1970-01-01 02:00:00')
) AS b ON (a.id = b.id) AND (a.ts = b.ts)
	MacBook-Pro-de-javi.local :) create table multiple_keys (tmp Int32, testMap Nested (a Int32, bKey Int32, value Int32)) Engine=SummingMergeTree() order by (tmp);

	CREATE TABLE multiple_keys
	(
	`tmp` Int32,
	`testMap` Nested(
	a Int32,
	bKey Int32,
	value Int32)
	)
	MacBook-Pro-de-javi.local :) select cityHash64(groupArray(cityHash64())) from A a asof inner join (select from B where ts<toDateTime('1970-01-01 02:00:00')) b on a.id=b.id and a.ts=b.ts where a.ts<toDateTime('1970-01-01 02:00:00');

	SELECT cityHash64(groupArray(cityHash64(*)))
	FROM A AS a
	ASOF INNER JOIN
	(
	SELECT *
	FROM B
	WHERE ts < toDateTime('1970-01-01 02:00:00')
	) AS b ON (a.id = b.id) AND (a.ts = b.ts)