glide-mq

⚡

1 Round-Trip Per Job

All queue operations complete in a single FCALL via Valkey Server Functions. No Lua EVAL overhead, no multi-command scripts.

🚀

Rust NAPI Core

Built on valkey-glide native bindings. Protocol parsing in Rust, not JavaScript. Lower latency, less GC pressure.

🌐

Cluster-Native

Hash-tagged keys route all queue data to the same slot automatically. Works identically on standalone and cluster.

🔗

Workflows & DAGs

FlowProducer for parent-child trees. chain(), group(), chord() helpers. Arbitrary DAG submission with cycle detection.

🤖

AI-Native Primitives

Cost tracking, token streaming, suspend/resume, budget caps, fallback chains, dual-axis rate limiting (RPM + TPM), and vector search - built into the queue, not bolted on.

☁

Serverless-Ready

ServerlessPool caches connections across warm invocations. Lightweight Producer class with zero EventEmitter overhead.

Concurrency	glide-mq	Leading Alternative	Delta
c=5	10,754 j/s	9,866 j/s	+9%
c=10	18,218 j/s	13,541 j/s	+35%
c=15	19,583 j/s	14,162 j/s	+38%

Concurrency

glide-mq

Leading Alternative

Delta

c=5

10,754 j/s

9,866 j/s

+9%

c=10

18,218 j/s

13,541 j/s

+35%

c=15

19,583 j/s

14,162 j/s

+38%

AI-Native Primitives

Every primitive AI orchestration needs - built into the queue, not a plugin or middleware.

Primitive	API	What it does
Cost tracking	`job.reportUsage()` / `queue.getFlowUsage()`	Record model, tokens, cost, latency per job. Aggregate across entire flows.
Token streaming	`job.stream()` / `queue.readStream()`	Stream LLM output tokens to consumers in real time. SSE proxy endpoint included.
Human-in-the-loop	`job.suspend()` / `queue.signal()`	Pause for approval. Resume with a named signal and payload. Zero compute while suspended.
Budget caps	`FlowProducer.add(flow, { budget })`	Cap total tokens or cost across all jobs in a flow. Pre-dispatch + post-completion enforcement.
Fallback chains	`opts.fallbacks` / `job.currentFallback`	Ordered model/provider alternatives tried automatically on failure.
Rate limiting	`tokenLimiter` + `limiter`	RPM (requests/min) + TPM (tokens/min) with per-queue and per-worker scopes.
Vector search	`queue.createJobIndex()` / `queue.vectorSearch()`	KNN similarity search over jobs via Valkey Search. Your jobs are your vector store.

Primitive

API

What it does

Cost tracking

job.reportUsage() / queue.getFlowUsage()

Record model, tokens, cost, latency per job. Aggregate across entire flows.

Token streaming

job.stream() / queue.readStream()

Stream LLM output tokens to consumers in real time. SSE proxy endpoint included.

Human-in-the-loop

job.suspend() / queue.signal()

Pause for approval. Resume with a named signal and payload. Zero compute while suspended.

Budget caps

FlowProducer.add(flow, { budget })

Cap total tokens or cost across all jobs in a flow. Pre-dispatch + post-completion enforcement.

Fallback chains

opts.fallbacks / job.currentFallback

Ordered model/provider alternatives tried automatically on failure.

Rate limiting

tokenLimiter + limiter

RPM (requests/min) + TPM (tokens/min) with per-queue and per-worker scopes.

Vector search

queue.createJobIndex() / queue.vectorSearch()

KNN similarity search over jobs via Valkey Search. Your jobs are your vector store.

typescript

const worker = new Worker('ai', async (job) => {
  const result = await callLLM(job.data.prompt);
  await job.reportUsage({ model: 'gpt-5.4', tokens: { input: 50, output: 200 }, costs: { total: 0.003 }, costUnit: 'usd' });
  await job.stream({ type: 'token', content: result });
  return result;
}, {
  connection,
  tokenLimiter: { maxTokens: 100000, duration: 60000 },
});

Package	Description
glide-mq	Core queue library
@glidemq/hono	Hono middleware
@glidemq/fastify	Fastify plugin
@glidemq/nestjs	NestJS module
@glidemq/hapi	Hapi plugin
@glidemq/dashboard	Express web dashboard

Package

Description

Core queue library

Hono middleware

Fastify plugin

NestJS module

Hapi plugin

Express web dashboard

glide-mqThe Node.js Message Queue - Fast, Reliable, AI-Ready

1 Round-Trip Per Job

Rust NAPI Core

Cluster-Native

Workflows & DAGs

AI-Native Primitives

Serverless-Ready

Performance

AI-Native Primitives

Quick Install

Ecosystem

glide-mqThe Node.js Message Queue - Fast, Reliable, AI-Ready

1 Round-Trip Per Job

Rust NAPI Core

Cluster-Native

Workflows & DAGs

AI-Native Primitives

Serverless-Ready

Performance ​

AI-Native Primitives ​

Quick Install ​

Ecosystem ​

Performance

AI-Native Primitives

Quick Install

Ecosystem