Key Concepts
Time Series Prediction
RavaPush analyzes data points arranged in chronological order (time series) and identifies patterns within them:
- Trends — Long-term directional changes
- Cyclicality — Repeating patterns
- Seasonality — Changes at fixed time intervals
- Anomalies — Data points deviating from normal patterns
Based on these patterns, the system generates probability distributions of future values and provides confidence intervals.
Zero-Shot Prediction
Due to pre-training on large-scale multi-domain data, RavaPush can be immediately applied to new time series:
- No need to train a model for each new dataset
- Can recognize common temporal patterns across domains
- Automatically improves as data accumulates
For specific applications, you can choose to fine-tune the model on your data to improve accuracy (typically 20-40% improvement).
Proactive Push
RavaPush doesn't just predict — it also proactively pushes alerts in the following situations:
- Deviation from historical patterns detected
- Predictions indicate future risks
- Preset thresholds reached
- Anomalous patterns match known risk signatures
The push system learns your response patterns to avoid excessive notifications.
Typical Applications
Tracking and Monitoring
Personal Habits
"You usually exercise twice a week; no exercise logged this week"
Resource Usage
"Server CPU usage has been rising over the past 3 days, predicted to reach threshold within 48 hours"
Customer Behavior
"Customer A order frequency dropped from 2/month to 0.5/month, churn probability 65%"
Progress and Planning
Project Estimation
"At current pace, completion requires 30 days, but deadline is 25 days. Suggest adjusting scope or extending timeline"
Capacity Planning
"At user growth rate (15% monthly average), 40% capacity expansion needed in 6 months"
Inventory Management
"At current sales rate, SKU-A123 will be sold out within 5 days"
Anomaly Detection
Quality Monitoring
"Error rate rose from 0.3% to 1.2%, exceeding normal range by 4 standard deviations"
Security Threats
"Login failure rate rose from 0.5% to 8%, possible brute force attack"
Supply Chain Risk
"Supplier A delivery delay increased from 2 days to 5 days, persisting for 3 weeks"
Data Requirements
Minimum Requirements
At least 30 data points, 100+ recommended
Relatively regular time intervals
Missing values less than 10%
Numerical univariate series
Best Conditions
Equal-interval time points
Consistent measurement standards
Clear patterns present
Minimal external interference
Not Suitable For
Fewer than 20 data points
Highly irregular or sparse
Frequently changing measurement methods
Purely random data with no patterns
Accuracy and Limitations
Factors Affecting Accuracy
Data Volume and Quality
More data = better pattern recognition. Regular data is more predictable than irregular data.
Prediction Range
Short-term (1-7 days) is usually most accurate; long-term (30+ days) has significantly increased uncertainty.
Pattern Stability
Stable patterns are easy to predict; frequently changing patterns are difficult to predict.
Main Limitations
Cannot Predict Unprecedented Events
The system is based on historical patterns. If something has never happened before, or a fundamental change occurs, predictions will fail.
If you always walk to work, the system cannot predict your commute change after buying a car.
Correlation, Not Causation
The system finds statistical correlations but doesn't understand causal relationships. Changing underlying factors may invalidate predictions.
The system knows "Task A usually takes 3 days," but doesn't know why.
Requires Sufficient Historical Data
| Data Volume | Prediction Capability |
|---|---|
| Fewer than 30 points | Unreliable |
| 30-100 points | Basic trends |
| 100+ points | Accurate pattern recognition |
| 500+ points | Fine-tuning possible |
Sensitive to Data Quality
Missing values, inconsistent intervals, and measurement method changes all reduce accuracy.
Technical Details
Model Architecture
RavaPush uses a decoder-only Transformer architecture:
- Autoregressive Generation — Predicts the next value step by step, similar to language models
- Patch-based — Processes sequence in segments (32 time points)
- Causal Attention — Only looks at past information, preventing data leakage
- Maximum Context — 2048 time points
Pre-training
The model is pre-trained on over 100 billion real-world time series data points, covering multiple industries and application scenarios, including various time granularities (hourly to quarterly), identifying common patterns across domains. This enables the model to:
- Perform zero-shot prediction on new time series
- Be more robust to noise and outliers
- Quickly adapt to new scenarios
Confidence Intervals
Each prediction includes confidence intervals (10th to 90th percentile):
- Narrow interval — High confidence, clear patterns
- Wide interval — Low confidence, high uncertainty
Confidence intervals reflect statistical uncertainty, not a guarantee of the future.
Fine-tuning (Optional)
For scenarios with sufficient data (500+ points) and special patterns, you can fine-tune the model:
- Fine-tune on top of the foundation model
- Retain generalization ability while enhancing sensitivity to specific patterns
- Accuracy typically improves by 20-40%
- Fine-tuning time: hours to one day
Best Practices
Data Collection
- Maintain consistent measurement methods and time intervals
- Record reasons for anomalous events
- Avoid long data gaps
- Accumulate as much historical data as possible
Interpreting Predictions
- Focus on confidence intervals, not just point predictions
- Compare predictions with actual results to understand system reliability
- Understand that predictions are suggestions; the final decision is yours
- Regularly update data to improve predictions
When to Trust Predictions
High Confidence
Stable historical patterns
100+ data points
Short-term prediction (1-7 days)
Low Confidence
Highly volatile data
30-50 data points
Long-term prediction (30+ days)
Comparison with Other Methods
| Strengths | Weaknesses | Best For | |
|---|---|---|---|
| vs Statistical Models (ARIMA, Prophet) | Better cross-scenario generalization No manual tuning Longer context | Weaker interpretability | Automated prediction across multiple data types |
| vs LLM | More efficient Better numerical accuracy Lower compute cost | Cannot incorporate text context | Pure numerical time series prediction |
Role in RavaAI
RavaPush is one of three core components of RavaAI:
RavaCore
Analyze current state
"What is the situation now"
RavaTimes
Project multiple scenarios
"What if we choose A/B/C"
RavaPush
Predict future trends
"What might happen next"
Synergy Example
Considering whether to accelerate project schedule:
1. RavaCore analyzes current state (remaining tasks, team configuration, historical velocity)
2. RavaPush predicts: "At current pace, 30 days to complete; deadline is 25 days; 70% delay probability"
3. RavaTimes projects options: maintain status quo, increase work hours, add staff
FAQ
How much data is needed?
Minimum 30 points to start predicting, 100+ points for reliable results, 500+ points for fine-tuning.
How far ahead can it predict?
Technically any length, but accuracy decreases over time. Short-term (1-7 days) is most accurate; long-term (30+ days) has high uncertainty.
What types of data does it handle?
Numerical univariate time series with regular intervals. Does not support text, categorical data, or multivariate prediction.
How does it handle missing data?
Forward fill, interpolation, or exclusion. Missing data exceeding 10% reduces reliability.
Do predictions update automatically?
Yes. After new data is recorded, the model recalculates patterns and updates predictions.
How is push frequency controlled?
The system uses confidence thresholds, importance assessment, frequency control, and learning your response patterns to avoid excessive notifications. Adjustable in settings.
How long does fine-tuning take?
500-1000 points takes a few hours; 1000+ points may take half a day to one day.
Privacy
- Your data is used to generate personal predictions
- Fine-tuned models serve only you
- Not shared with other users
- Can be deleted at any time