Files

Connor 211b590b30 phase 1

2025-12-30 20:14:31 +09:00

12 KiB

Raw Blame History

Cursebreaker Unity Parser - [ ] Implementation Roadmap

Overview

This roadmap breaks down the development into 5 phases, each building on the previous. Each phase has clear deliverables and success criteria.

Phase 1: Project Foundation & YAML Parsing ✅ COMPLETED

Goal: Set up project structure and implement basic YAML parsing for Unity files

Tasks

Project Setup
- Initialize Cargo project with workspace structure
- Add core dependencies (yaml parser, serde, thiserror)
- Set up basic module structure (lib.rs, parser/, model/, error.rs)
- Configure Cargo.toml with metadata and feature flags
Error Handling
- Define error types (ParseError, ReferenceError, etc.)
- Implement Display and Error traits
- Set up Result type aliases
YAML Document Parser
- Implement Unity YAML document reader
- Parse YAML 1.1 header and Unity tags
- Split multi-document YAML files into individual documents
- Handle %TAG !u! tag:unity3d.com,2011: directive
Unity Tag Parser
- Parse Unity type tags (!u!1, !u!224, etc.)
- Extract type ID from tag
- Handle anchor IDs (&12345)
Basic Testing
- Set up test infrastructure
- Create minimal test YAML files
- Unit tests for YAML splitting and tag parsing
- Integration test: parse simple Unity file

Deliverables

✓ Working Cargo project structure
✓ YAML documents successfully split from Unity files
✓ Unity type IDs and file IDs extracted
✓ Basic error handling in place
✓ Tests passing

Success Criteria

Can read Scene01MainMenu.unity and split into individual documents
Each document has correct type ID and file ID
No panics on malformed input (returns errors)

Implementation Notes:

Created comprehensive error handling with thiserror
Implemented regex-based Unity tag parser with caching
Built YAML document splitter that handles multi-document files
Created model with UnityFile and UnityDocument structs
Added 23 passing tests (12 unit, 7 integration, 4 doc tests)
Successfully parses real Unity files from PiratePanic sample project

Phase 2: Data Model & Property Parsing

Goal: Build the core data model and parse Unity properties into structured data

Tasks

Core Data Structures
- Implement UnityDocument struct
- Implement UnityFile struct
- Create property storage (PropertyMap using IndexMap)
- Define FileID and LocalID types
Property Value Types
- Implement PropertyValue enum (Integer, Float, String, Boolean, etc.)
- Add Vector3, Color, Quaternion value types
- Add Array and nested Object support
- Implement Debug and Display for PropertyValue
Property Parser
- Parse YAML mappings into PropertyMap
- Handle nested properties (paths like m_Component[0].component)
- Parse Unity-specific formats:
  - {fileID: N} references
  - {x: 0, y: 0, z: 0} vectors
  - {r: 1, g: 1, b: 1, a: 1} colors
  - {guid: ..., type: N} external references
GameObject & Component Models
- Create specialized GameObject struct
- Create base Component trait/struct
- Add common component types (Transform, RectTransform, etc.)
- Helper methods for accessing common properties
Testing
- Unit tests for property parsing
- Test all PropertyValue variants
- Integration test: parse GameObject with components
- Snapshot tests using sample Unity files

Deliverables

✓ Complete data model implemented
✓ Properties parsed into type-safe structures
✓ GameObject and Component abstractions working
✓ All property types handled correctly

Success Criteria

Parse entire CardGrabber.prefab correctly
Extract all GameObject properties (name, components list)
Extract all Component properties with correct types
Can access nested properties programmatically

Phase 3: Reference Resolution & Unity Type System

Goal: Resolve references between objects and implement Unity's type system

Tasks

Reference Types
- Implement FileReference struct (fileID + optional GUID)
- Implement LocalReference (within-file references)
- Implement ExternalReference (cross-file GUID references)
- Add reference equality and comparison
Type ID Mapping
- Create Unity type ID → class name mapping
- Common types: GameObject(1), Transform(4), MonoBehaviour(114), etc.
- Load type mappings from data file or hardcode common ones
- Support unknown type IDs gracefully
Reference Resolution
- Implement within-file reference resolution
- Cache resolved references for performance
- Handle cyclic references safely
- Detect and report broken references
UnityProject Multi-File Support
- Implement UnityProject struct
- Load multiple Unity files into project
- Build file ID → document index
- Cross-file reference resolution (GUID-based)
Query Helpers
- Find object by file ID
- Find objects by type
- Find objects by name
- Get component from GameObject
- Follow reference chains
Testing
- Test reference resolution within single file
- Test cross-file references (scene → prefab)
- Test broken reference handling
- Test circular reference detection

Deliverables

✓ All references within files resolved correctly
✓ Type ID system working with common Unity types
✓ UnityProject can load and query multiple files
✓ Query API functional

Success Criteria

Load entire PiratePanic/Scenes/ directory
Resolve all GameObject → Component references
Resolve prefab references from scenes
Find objects by name across entire project
Handle missing references gracefully

Phase 4: Optimization & Robustness

Goal: Optimize performance and handle edge cases

Tasks

Performance Optimization
- Profile parsing performance on large files
- Implement string interning for common property names
- Optimize property access paths (cache lookups)
- Consider zero-copy parsing where possible
- Add lazy loading for large projects
Memory Optimization
- Measure memory usage on large projects
- Use Cow<str> where appropriate
- Pool allocations for common types
- Implement Drop for cleanup
- Add memory usage benchmarks
Parallel Processing
- Add optional rayon dependency
- Parallel file loading
- Parallel document parsing within files
- Thread-safe caching
Error Recovery
- Graceful degradation on parse errors
- Partial file parsing (skip invalid documents)
- Better error messages with context
- Error recovery suggestions
Edge Cases
- Handle very large files (>100MB scenes)
- Handle deeply nested properties
- Handle unusual property types
- Handle legacy Unity versions (different YAML formats)
- Handle corrupted files
Comprehensive Testing
- Parse entire PiratePanic project
- Parse various Unity project versions
- Stress tests with large files
- Fuzz testing setup (optional)
- Property-based tests

Deliverables

✓ Optimized parsing (<100ms for 10MB file)
✓ Low memory footprint (linear scaling)
✓ Parallel parsing support
✓ Robust error handling
✓ Comprehensive test suite

Success Criteria

Parse 10MB scene file in <100ms
Parse entire PiratePanic project in <1s
Memory usage < 2x file size
100% of PiratePanic files parse successfully
No panics on malformed input

Phase 5: API Polish & Documentation

Goal: Finalize public API and create excellent documentation

Tasks

API Review & Refinement
- Review all public APIs for consistency
- Add convenience methods based on common use cases
- Ensure ergonomic API design
- Add builder patterns where appropriate
- Minimize unsafe code, document when necessary
Type Safety Improvements
- Add type-safe component access methods
- Strongly-typed property getters
- Generic query API improvements
- Consider proc macros for component definitions (optional)
Documentation
- Write comprehensive rustdoc for all public items
- Add code examples to every public function
- Create module-level documentation
- Write getting started guide
- Create cookbook with common tasks
Examples
- Basic parsing example
- Query API example
- Reference resolution example
- Multi-file project example
- Performance tips example
README & Guides
- Professional README.md
- Architecture documentation
- Contributing guide
- Changelog template
- License file (Apache 2.0 or MIT)
CI/CD Setup
- GitHub Actions workflow
- Run tests on PR
- Clippy lints
- Format checking
- Code coverage reporting
- Benchmark tracking
Benchmarks
- Benchmark suite for common operations
- Track performance over time
- Document performance characteristics
- Comparison with other parsers (if any exist)

Deliverables

✓ Clean, documented public API
✓ Comprehensive rustdoc with examples
✓ README and getting started guide
✓ Working examples
✓ CI/CD pipeline

Success Criteria

Every public item has rustdoc
At least 3 working examples
CI passes on all commits
README clearly explains usage
Someone new can use library from docs alone

Phase 6: Future Enhancements (Post-v1.0)

These are potential features for future versions:

Advanced Querying

XPath-like query language for Unity objects
Filter DSL for complex searches
Object graph traversal API
Dependency analysis tools

Write Support

Modify Unity files programmatically
Create new Unity objects
Safe YAML serialization
Preserve formatting and comments

Additional Formats

.meta file parsing
TextMesh Pro asset files
Unity package manifest parsing
C# script analysis integration

Tooling

CLI tool built on library
Web service for Unity file analysis
VS Code extension for Unity file viewing
Unity Editor plugin for exporting metadata

Performance

Binary format support (legacy Unity)
Streaming API for huge files
Incremental parsing (watch mode)
Serialization/deserialization optimizations

Development Guidelines

Code Quality

Follow Rust API guidelines
Use clippy with strict lints
Maintain >80% test coverage
No unsafe unless absolutely necessary
All public APIs must be documented

Testing Philosophy

Unit test every parser component
Integration tests for full workflows
Use real Unity files from PiratePanic
Add regression tests for bugs
Benchmark critical paths

Version Strategy

Semantic versioning (SemVer)
0.x.x during development
1.0.0 when API is stable
Changelog for all versions
No breaking changes in minor versions after 1.0

Dependencies

Minimize dependency count
Use well-maintained crates only
Avoid nightly features
Keep MSRV (Minimum Supported Rust Version) reasonable
Document all feature flags

Estimated Milestones

These are rough estimates for a single developer working part-time:

Phase 1: 1-2 weeks
Phase 2: 2-3 weeks
Phase 3: 2-3 weeks
Phase 4: 1-2 weeks
Phase 5: 1-2 weeks

Total: 7-12 weeks to v1.0

Phases can overlap and tasks can be parallelized. Testing happens continuously throughout all phases.

Getting Started

To begin implementation:

Start with Phase 1, Task 1 (Project Setup)
Work through tasks sequentially within each phase
Complete all deliverables before moving to next phase
Use PiratePanic sample project for testing throughout
Iterate based on what you learn from the Unity files

Remember: Start simple, make it work, then make it fast. Focus on correctness and API design in early phases, optimization comes later.

12 KiB Raw Blame History

Cursebreaker Unity Parser - [ ] Implementation Roadmap

Overview

Phase 1: Project Foundation & YAML Parsing ✅ COMPLETED

Tasks

Deliverables

Success Criteria

Phase 2: Data Model & Property Parsing

Tasks

Deliverables

Success Criteria

Phase 3: Reference Resolution & Unity Type System

Tasks

Deliverables

Success Criteria

Phase 4: Optimization & Robustness

Tasks

Deliverables

Success Criteria

Phase 5: API Polish & Documentation

Tasks

Deliverables

Success Criteria

Phase 6: Future Enhancements (Post-v1.0)

Advanced Querying

Write Support

Additional Formats

Tooling

Performance

Development Guidelines

Code Quality

Testing Philosophy

Version Strategy

Dependencies

Estimated Milestones

Getting Started

12 KiB

Raw Blame History