Schema on Write

Summary

Schema on Write is a data validation approach where the structure and format of incoming data are strictly enforced during the ingestion process, before the data is written to storage systems. This methodology is essential for maintaining data integrity in industrial data historians and time-series databases, ensuring that only conforming data reaches critical systems used in Model Based Design and predictive maintenance applications.

Core Fundamentals

Schema on Write operates on the principle of early validation, where data must conform to a predefined schema before being accepted into the system. This approach contrasts sharply with Schema on Read, where validation occurs during data retrieval rather than ingestion.

The validation process involves several key steps:

  1. Schema Definition: Engineers define the expected structure, data types, and constraints for incoming data
  2. Ingestion Validation: Each incoming data record is checked against the schema
  3. Rejection Handling: Non-conforming data is rejected or directed to error handling systems
  4. Storage: Only validated data is written to the primary storage system
Diagram

Applications in Industrial Systems

Industrial Data Collection

In manufacturing environments, Schema on Write ensures that sensor readings from PLCs, SCADA systems, and IoT devices conform to expected formats. This is crucial for maintaining data quality in systems that monitor equipment performance, environmental conditions, and production metrics.

Process Control Systems

Control systems rely on consistent data formats for real-time decision making. Schema on Write validation prevents malformed data from corrupting control algorithms and ensures that time-series analysis functions receive properly structured inputs.

Compliance and Auditing

Regulatory requirements in industries like pharmaceuticals and aerospace demand strict data governance. Schema on Write provides the necessary validation layer to ensure data meets compliance standards from the moment of collection.

Implementation Considerations

When implementing Schema on Write in industrial systems, engineers should consider:

  1. Performance Impact: Validation adds latency to the ingestion process, which may affect high-frequency data collection scenarios
  2. Schema Evolution: Industrial systems evolve over time, requiring carefully planned schema migration strategies
  3. Error Handling: Robust mechanisms for handling validation failures, including data quarantine and alerting systems
  4. Batch vs. Stream Processing: Different validation strategies may be needed for batch uploads versus real-time streaming data

Performance Trade-offs

Schema on Write optimizes for data quality and consistency at the expense of ingestion flexibility and speed. Key considerations include:

- Write Performance: Additional validation steps increase ingestion latency

- Read Performance: Pre-validated data enables faster query execution

- Storage Efficiency: Consistent schemas allow for better compression and indexing

- Maintenance Overhead: Schema changes require careful coordination across data pipelines

Best Practices for Industrial Applications

  1. Design for Evolution: Create schemas that accommodate future sensor additions and data format changes
  2. Implement Graceful Degradation: Ensure systems can handle validation failures without complete service interruption
  3. Monitor Validation Metrics: Track rejection rates and validation performance to identify data quality issues
  4. Document Schema Changes: Maintain comprehensive documentation for schema evolution to support system maintenance
  5. Test Validation Logic: Thoroughly test validation rules with representative industrial data samples

Schema on Write represents a foundational approach to data quality management in industrial systems, providing the reliability and consistency required for mission-critical applications while requiring careful consideration of performance and flexibility trade-offs.

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.