amawta
Products/EigenWeights
Inference optimization

Inference with lower latency.

EigenWeights simplifies transformer MLP layers to speed up inference while maintaining capacity.

30%faster

EigenWeights

MLP layers represent a significant portion of compute in transformers. EigenWeights finds more efficient representations that accelerate inference without retraining.

1

30% Faster

Significantly reduces inference latency.

2

Plug & Play

Direct replacement compatible with standard transformer architectures.

3

No Retraining

Applicable to existing pre-trained models.

Live Demo

Technical demo

Try compression on demo embeddings or upload your own file.

Runnable demo with visible metrics.

Demo Note: This demo uses EigenDB vector compression technology. The results shown are specific to vector embedding compression. For model weight compression, the principles are similar but applied to different data structures.

Click to analyze 1,000 random embeddings and see compression results

Applications

Use Cases

1

High-frequency APIs

2

On-premise models

3

Real-time applications

Benchmarks

Measured results

Internal benchmarks with documented methodology.

EigenWeights Performance

Model Size
70B
8.7B
Inference Speed
3.2x
Accuracy Retention
97.5%
Head-to-Head

EigenDB compared with alternatives

Benchmarks on 384-dimensional embeddings (sentence-transformers)

24x
Compression
384D → 16D
100%
Recall@10
Precision retained
96%
Cost Savings
$600 → $24/mes
MetricFAISSVerifiedChromaVerifiedElasticsearchVerifiedWeaviateVerifiedPineconeEigenDBVerified
Compression1x1x1x1x1x24xBest result
Recall@10100%100%100%100%95%+100%
Storage Cost100%100%100%100%100%4%
Search Latency1.39ms0.56ms5.86ms1.09ms26-60ms0.04ms
Index Build0.16ms40.5ms861ms1298msmanaged0.019ms

Dataset: 500 embeddings, 384D (sentence-transformers/all-MiniLM-L12-v2). Benchmarks run on local hardware.

FAISS, Chroma, Elasticsearch, Weaviate: our benchmarks. Pinecone: official documentation data.

The main difference is compression.

In this benchmark, alternatives store all dimensions. EigenDB reduces dimensions and preserves measured recall.

Research Line

Core Research

Alongside products, we maintain theoretical and experimental work on intelligent systems.

Neural Ontology

Evaluated with EEG, mouse-neuron, and cognitive-task data.

Experimental batteries

Results recorded in internal and external tests where appropriate.

KAIROS Framework

Conceptual framework for organizing hypotheses about intelligent systems.

We combine practical tools with documented theoretical research.