longbow

module

v0.1.1 Latest Latest Go to latest Published: Dec 22, 2025 License: Unlicense

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/23skdu/longbow

Links

Open Source Insights

README ¶

Longbow

Longbow is a distributed, high-performance vector cache built for modern AI/Agentic workloads. It leverages zero-copy data paths, SIMD optimizations, and advanced storage backends to deliver sub-millisecond latency.

Key Features

High Performance: Built on Apache Arrow for zero-copy data transfer.
Distributed: Consistent hashing and gossip-based membership (SWIM protocol).
Optimized Storage: Optional io_uring WAL backend for high-throughput ingestion.
Hardware Aware: NUMA-aware memory allocation and SIMD vector distance calculations.
Smart Client: Resilient Go SDK that handles request routing transparently.

Architecture

Longbow uses a shared-nothing architecture where every node is identical. Data is sharded across the cluster using consistent hashing.

See Architecture Guide for a deep dive.

Getting Started

Prerequisites

Go 1.25+
Linux (recommended for best performance) or macOS

Installation

git clone https://github.com/23skdu/longbow.git
cd longbow
go build -o bin/longbow ./cmd/longbow

Running a Local Cluster

./scripts/start_local_cluster.sh

Running Benchmarks

Longbow includes a distributed benchmark tool:

go build -o bin/bench-tool ./cmd/bench-tool
./bin/bench-tool --mode=ingest --concurrency=4 --duration=10s

Configuration

Longbow is configured via environment variables. See Configuration for details.

Notable flags:

STORAGE_USE_IOURING=true (Enable new Linux storage engine)
LONGBOW_GOSSIP_ENABLED=true (Enable distributed discovery)
Protocol: Apache Arrow Flight (over gRPC/HTTP2).
Search: High-performance HNSW vector search with hybrid (Dense + Sparse) support.
Filtering: Metadata-aware predicate filtering for searches and scans.
Lifecycle: Support for vector deletion via tombstones.
Durable: WAL with Apache Parquet format snapshots.
Storage: In-memory ephemeral storage for zero-copy high-speed access.
Observability: Structured JSON logging and 100+ Prometheus metrics.

Architecture & Ports

To ensure high performance under load, Longbow splits traffic into two dedicated gRPC servers:

Data Server (Port 3000): Handles heavy I/O operations (DoGet, DoPut, DoExchange).
Meta Server (Port 3001): Handles lightweight metadata operations (ListFlights, GetFlightInfo, DoAction).

Why? Separating these concerns prevents long-running data transfer operations from blocking metadata requests. This ensures that clients can always discover streams and check status even when the system is under heavy write/read load.

Observability & Metrics

Longbow exposes Prometheus metrics on a dedicated port to ensure observability without impacting the main Flight service.

Scrape Port: 9090
Scrape Path: /metrics

Custom Metrics

Key Metrics

Metric Name	Type	Description
`longbow_flight_operations_total`	Counter	Total number of Flight operations (DoGet, DoPut, etc.)
`longbow_flight_duration_seconds`	Histogram	Latency distribution of Flight operations
`longbow_flight_rows_processed_total`	Counter	Total rows processed in scans and searches
`longbow_vector_search_latency_seconds`	Histogram	Latency of k-NN search operations
`longbow_vector_index_size`	Gauge	Current number of vectors in the index
`longbow_tombstones_total`	Gauge	Number of active deleted vector tombstones
`longbow_index_queue_depth`	Gauge	Depth of the asynchronous indexing queue
`longbow_memory_fragmentation_ratio`	Gauge	Ratio of system memory reserved vs used
`longbow_wal_bytes_written_total`	Counter	Total bytes written to the WAL
`longbow_snapshot_duration_seconds`	Histogram	Duration of the Parquet snapshot process
`longbow_evictions_total`	Counter	Total number of evicted records (LRU)
`longbow_ipc_decode_errors_total`	Counter	Count of IPC decoding errors or panics

For a detailed explanation of all 100+ metrics, see Metrics Documentation.

Standard Go runtime metrics are also exposed.

Usage

Running locally

go run cmd/longbow/main.go

Docker

docker build -t longbow .
docker run -p 3000:3000 -p 3001:3001 -p 9090:9090 longbow

Documentation

Directories ¶

Path	Synopsis
client
cmd
bench-tool command
longbow command
ring-sim command
internal
gpu
limiter
logging
mesh
mesh/sync
metrics
sharding
simd
store Package store provides a pooled memory allocator for Arrow operations.	Package store provides a pooled memory allocator for Arrow operations.
store/memory
telemetry

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL