Hardware / Cloud Requirements
Requirements for a single instance/VM
For simplicity and cost-effectiveness, you can run FormX.ai on a single server/instance; However, a single server cannot train new extractors; it can only run pre-built extractors or extractors trained on the FormX.ai SaaS platform.
Minimum Specification:
- 16 Core Intel CPU (each core must be at least 2.6Ghz or faster)
 - 32GB RAM
 - 120GB SSD
 - GPU is not required
 
Requirements for a cluster
FormX.ai deploys on the Kubernetes cluster by default. Here are the minimum specifications:
Purposes  | Number of Instances  | Minimal Specification  | 
|---|---|---|
API / Extraction Workers  | 3 VMs (minimal for Kubernetes)  | 8 vCPU  | 
Database (PostgreSQL) (Using managed PostgreSQL is recommended)  | 
  | 4 vCPU  | 
Self-Hosted OCR 
  | 
  | 8 vCPU  | 
Self-Hosted and Fine-Tune LLM Model 
  | 
  | 4 vCPU GPUs, one of: 
  | 
ML Workers for dataset generation 
  | 3 VMs  | 8 vCPU  | 
ML Trainer for model training 
  | 1 VM (more for parallel training)  | 8 vCPU  | 
Storage 
  | 10GBs+ (depends on image size)  | 
Cloud Resources Inventories
For a typical Cloud Deployment, here are the list of Cloud Resources Required:
Inventory  | Purposes  | Related Cloud Products  | 
|---|---|---|
Kubernetes  | Run the applications, workers, trainers  | GCP GKE  | 
Database  | Store the configs, audit logs, temporarily result for async requests  | GCP Cloud SQL for PostgreSQL  | 
Image Storage  | Storage of the images for training (optional)  | Google Cloud Storage  | 
OCR  | OCR  | Google Vision API  | 
Other Software Components  | Redis: Cache authentication tokens  | Using some pods on the k8s cluster  | 
Updated about 2 months ago