refactor(resolver): overhaul plugin system and dependency handling

Core Changes:
- Completely rewrote CustomResolver reducer with dependency-ordered processing
- Enhanced plugin initialization with proper dependency injection
- Improved delta processing and property value tracking
- Added robust error handling for duplicate property IDs

Resolver Improvements:
- Updated to use new accumulator structure
- Implemented execution order processing for plugins
- Enhanced debug logging and error reporting
- Simplified TimestampResolver by removing unused initializer

Configuration Updates:
- Added TypeScript path aliases for test helpers
- Improved module resolution paths

Key Benefits:
- More robust plugin dependency management
- More efficient state updates
- Enhanced type safety
- Better error messages and debugging
- More consistent plugin initialization

This refactoring focuses on improving the robustness of the resolver,
especially around plugin lifecycle management and dependency handling.
The changes ensure better separation of concerns and more predictable
behavior when dealing with complex plugin dependencies.

2025-06-25 06:10:34 -05:00

3.1 KiB

Raw Blame History

Schema Validation in Rhizome-Node

This document explains how schema validation works with deltas in Rhizome-Node.

Overview

Schema validation in Rhizome-Node is enforced at the TypedCollection level when using the put method, which validates data before creating deltas. This means:

Local Changes: When you use collection.put(), the data is validated against the schema before any deltas are created and ingested.
Peer Changes: Deltas received from other peers are ingested without validation by default, which means invalid data can enter the system.
Validation Tracking: The system tracks which entities are valid/invalid after ingestion.

Example Usage

// 1. Define a schema for users
const userSchema = SchemaBuilder
  .create('user')
  .name('User')
  .property('name', PrimitiveSchemas.requiredString())
  .property('email', PrimitiveSchemas.email())
  .property('age', PrimitiveSchemas.integer({ minimum: 0 }))
  .required('name')
  .build();

// 2. Create a typed collection with strict validation
const collection = new TypedCollectionImpl<{
  name: string;
  email?: string;
  age?: number;
}>('users', userSchema, schemaRegistry, {
  strictValidation: true // Enable strict validation
});

// Connect to the node
collection.rhizomeConnect(node);

// 3. Local changes - validated on put()
// Valid usage - will pass schema validation
await collection.put('user1', { 
  name: 'Alice', 
  email: 'alice@example.com',
  age: 30
});

// Invalid usage - will throw SchemaValidationError
await expect(collection.put('user2', {
  email: 'invalid-email', // Invalid email format
  age: -5                 // Negative age
})).rejects.toThrow(SchemaValidationError);

// 4. Peer data - ingested without validation by default
const unsafeDelta = createDelta('peer1', 'peer1')
  .setProperty('user3', 'name', 'Bob', 'users')
  .setProperty('user3', 'age', 'not-a-number', 'users')
  .buildV1();

// This will be ingested without validation
node.lossless.ingestDelta(unsafeDelta);

// 5. Check validation status after the fact
const stats = collection.getValidationStats();
debug(`Valid: ${stats.validEntities}, Invalid: ${stats.invalidEntities}`);

// Get details about invalid entities
const invalidUsers = collection.getInvalidEntities();
invalidUsers.forEach(user => {
  debug(`User ${user.entityId} is invalid:`, user.errors);
});

Key Points

Validation Timing

Schema validation happens in TypedCollection.put() before deltas are created
Deltas from peers are ingested without validation by default

Validation Modes

strictValidation: true: Throws errors on invalid data (recommended for local changes)
strictValidation: false: Allows invalid data but tracks it (default)

Monitoring

Use getValidationStats() to get counts of valid/invalid entities
Use getInvalidEntities() to get detailed error information

Best Practices

Always validate data before creating deltas when accepting external input
Use strictValidation: true for collections where data integrity is critical
Monitor validation statistics in production to detect data quality issues
Consider implementing a validation layer for peer data if needed

3.1 KiB Raw Blame History