Skip to content

Instantly share code, notes, and snippets.

View manish's full-sized avatar

Manish Sinha manish

View GitHub Profile
@manish
manish / manta-v1-limitations.md
Last active January 12, 2026 19:20
Manta V1 Limitations and Missing Features (from VLDB Paper)

Manta V1 Limitations and Missing Features

Updated: 2026-01-12

Architecture Limitations

The system operates as a "Single-Node Only" setup without replication, sharding, or distributed queries. Additionally, there is "No Query Admission Control" - meaning no concurrent query limits, memory budgets per query, or queueing mechanisms.

SQL Coverage Gaps

@manish
manish / cpu_hotspots.md
Created January 4, 2026 05:23
Manta Query CPU Hotspots - Profiling Results (2026-01-03)

Manta Query CPU Hotspots

Profiling run: 2026-01-03 Branch: feat/benchmark-messagepack-ingest Git SHA: 95efdca Data: 1M rows Events dataset (standard preset)


1. LOWER(UPPER(region)) GROUP BY (~106ms)

@manish
manish / benchmark_comparison.json
Last active December 30, 2025 16:29
Manta Query Optimization Benchmark Report
{
"benchmark_info": {
"tool": "ultimate_perf_test --quick",
"iterations_per_query": 20,
"metric": "TrimMean (ms)",
"dataset": {
"total_rows": 100000,
"total_columns": 106,
"core_columns": 12,
"sparse_metrics": 90
@manish
manish / benchmark_comparison.json
Created December 30, 2025 07:30
Manta Performance Benchmark: optimizations (e1e6505) vs main (c809a5f) - ~8.5% improvement
{
"benchmark_info": {
"tool": "ultimate_perf_test --quick",
"iterations_per_query": 20,
"metric": "TrimMean (ms)",
"dataset": {
"total_rows": 100000,
"total_columns": 106,
"core_columns": 12,
"sparse_metrics": 90
@manish
manish / benchmark_comparison.json
Created December 29, 2025 22:08
Manta Performance Benchmarks: main (c809a5f) vs optimizations (1ecc713) - 36.2% improvement
{
"benchmark_date": "2025-12-29T22:07:18.976341494+00:00",
"branches": {
"main": "c809a5f",
"optimizations": "1ecc713"
},
"dataset": {
"rows": 100000,
"columns": 106,
"min_cols_per_row": 16,
@manish
manish / breezy-pondering-ocean.md
Created December 29, 2025 09:06
Manta Analytics Database Optimization Plan - Research-backed recommendations with multi-model AI consensus

Manta Optimization Plan: Cutting-Edge Analytics Database

Executive Summary

This plan outlines research-backed optimizations to make Manta a cutting-edge analytics database. Based on extensive research of academic papers, industry blog posts, and multi-model AI consensus (Gemini-3-Pro + GPT-5.2), we recommend a prioritized roadmap focusing on "skip work" optimizations that outperform micro-optimizations.

Current State: Manta achieves 7-8M rows/sec with sparse columnar storage, time-partitioned slabs, HyperLogLog, T-Digest, and SIMD_LANES=4.

Target State: 2-10x performance improvement through zone maps, lazy materialization, and adaptive compression.

@manish
manish / swift-wobbling-peacock.md
Created December 24, 2025 12:25
Manta: In-Memory Analytics Database - Implementation Plan

Manta: In-Memory Analytics Database

Fast, graceful, glides through data — inspired by Facebook Scuba


EXECUTIVE SUMMARY

What We're Building

A Scuba-inspired, single-node, in-memory analytics database in Rust. Optimized for high-throughput event ingestion and fast aggregation queries.

@manish
manish / slack-app-registration.md
Created September 13, 2025 20:06
AskMarkAI - Slack App Registration Instructions

AskMarkAI - Slack App Registration Instructions

This guide walks through registering a Slack app for the AskMarkAI integration.

1. Create the Slack App

  • Go to https://api.slack.com/apps
  • Click "Create New App"
  • Choose "From scratch"
  • App Name: AskMarkAI (or your preferred name)
  • Pick a workspace to develop your app in
@manish
manish / askmarkai-upload-workflow-trace.md
Last active September 10, 2025 19:31
Complete Upload Workflow Trace Analysis for AskMarkAI - Text Content, File Attachments, and URL Processing

🔍 Complete Upload Workflow Trace Analysis - AskMarkAI

Analysis Date: $(date)
System: AskMarkAI Document Upload & Processing Pipeline
Scope: Text Content, File Attachments, and URL Processing workflows

Executive Summary

This comprehensive trace analysis reveals how AskMarkAI handles three distinct document upload types through a unified, event-driven architecture. All upload paths converge at a single document processor triggered by S3 events, ensuring consistent processing and vector embedding generation.

Deletion Workflow Analysis for AskMarkAI Document System

Overview

This analysis examines the deletion workflow for three content types in the AskMarkAI document system:

  1. Text Content - Direct text input
  2. File Attachments - PDF/DOC files uploaded via presigned URLs
  3. URL Processing - HTML content extracted from web URLs

Complete Deletion Flow