A universal framework for training creative agents in symbolic domains
RfC (Reinforcement for Creativity) is a learning framework designed to train agents that creatively explore and generate novel solutions rather than simply predict outputs. Unlike traditional neural networks that learn input-output mappings through supervised training, RfC encourages constructive creativity through structured exploration and composite rewards.
| Learns to predict outputs | Learns to explore creatively |
| Supervised training | Training with creative incentives |
| Minimizes prediction error | Maximizes validity and novelty |
| Memorizes patterns | Discovers new patterns |
- Environment: Defines the domain and provides context for exploration
- Generator: Parameterized agent that generates candidate solutions
- Flexible Coach: Evaluator that assesses validity and novelty
- RfC Trainer: Orchestrates the creative training loop
For installation, you need to open the .ipynb file in Colab.
Traditional approach:
"Provide conjecture data and train a model to predict new ones."
RfC approach:
"Explore the space of mathematical conjectures creatively; the Coach evaluates validity and novelty to guide the agent toward maximizing both."
- Generates novel number theory conjectures
- Verifies validity using symbolic proofs
- Identifies non-trivial mathematical patterns
- Synthesizes efficient algorithms creatively
- Explores iterative, recursive, and formula-based approaches
- Evaluates correctness and computational efficiency
- Discovers new inference rules in formal logic
- Generates valid inference patterns
- Verifies logical soundness and identifies novel proof strategies
RfC is fully modular and extensible. Here's how to define your own creative domain:
Where:
- v = validity score
- η = novelty score
- C(a) = complexity penalty
RfC excels in domains where:
- ✅ Validity rules can be clearly defined
- ✅ Creative exploration is desired
- ✅ Novelty is as important as correctness
- ✅ The search space is combinatorial or symbolic
- Mathematics: Theorem and conjecture discovery
- Programming: Algorithm synthesis, code optimization
- Formal Logic: New inference rules, proof tactics
- Game Design: Novel mechanics, rule balancing
- Molecular Design: Valid structures with desired properties
- Music Composition: Novel harmonic progressions
- Architecture: Creative structural designs
No. RfC is specifically designed for constructive creativity with key differences:
- Separates domain knowledge from the agent
- Uses a deterministic Flexible Coach with structured feedback
- Explicitly incentivizes novelty, not only reward
- Action space is typically symbolic/combinatorial
Not necessarily:
- Small domains: CPU sufficient
- Simple generators: MLPs, rules, templates work well on CPU
- Large domains: GPU beneficial for Transformers/LLMs
Yes! Any generative model can be integrated:
- Small spaces (< 10⁶ states): Excellent
- Medium spaces (10⁶–10⁹): Good with efficient generators
- Very large spaces: Use hierarchies and modularity
The references are from my paper titled RfC (Reinforcement for Creativity): Universal Architecture for Adaptive Creative Agents on OSF: https://osf.io/74dxz/overview
Contributions are welcome! Areas of interest:
- New application domains
- Improvements to Coaches
- More sophisticated Generators
- Performance optimizations
Please open an issue or submit a pull request.
RfC: Not just machine learning. Automated creativity.
Made with 🧠 for creative AI
.png)
![Chinese Takeout Menu [video]](https://www.youtube.com/img/desktop/supported_browsers/chrome.png)

