(Pre S ig?: This is a Sia contained project, since there were doubts about the cross-chain integration, and I was told there would be a meeting next week)
Introduction
Project Name:
ProofChain: Verifiable AI Training with Sia
Name of the organization or individual submitting the proposal:
Hridyansh (Independent Researcher & Builder)
Describe your project.
ProofChain is a cryptographic provenance framework that enables AI developers to prove their models were trained on specific datasets. The system uses dataset hashing, Merkle commitments, and decentralized storage on Sia to anchor training data in a tamper‑proof, user‑owned environment. Training attestations (dataset hash, code hash, model weights hash) are then published as verifiable certificates, ensuring transparency and compliance without exposing sensitive data.
The MVP will demonstrate:
- Dataset commitment and storage on Sia.
- Training attestation generation.
- A public registry of provenance certificates stored on Sia.
How does the projected outcome serve the Foundation’s mission of user-owned data?
This project directly advances the mission of user‑owned data by:
- Ensuring datasets remain under the control of their owners, encrypted and sharded across Sia’s decentralized network.
- Preventing centralized platforms from monopolizing AI provenance by anchoring proofs in a user‑owned, decentralized storage layer.
- Empowering developers and organizations to prove compliance and transparency without ceding control of their data to third parties.
By making Sia the backbone of AI provenance, we extend its role from decentralized storage to decentralized trust infrastructure.
Are you a resident of any jurisdiction on that list?
No
Will your payment bank account be located in any jurisdiction on that list?
No
Grant Specifics
Amount of money requested: $9,500
Budget Breakdown
| Category | Amount (USD) | Description & Justification | Month |
|---|---|---|---|
| Development & Infrastructure | $5,000 | Covers developer time (coding, integration, documentation) and purchase of a dedicated server. The server will handle dataset hashing, Merkle tree generation, and computationally heavy attestations, while also serving as a long‑term asset for future Sia‑integrated products. | 1st |
| Security & Code Review | $2,000 | Lightweight audit of cryptographic routines and Sia integration to ensure correctness, reliability, and credibility of the MVP. | 2nd |
| Documentation & Demo Materials | $1,500 | Preparation of technical documentation, open‑source repository setup, and creation of demo materials (e.g., walkthrough video, usage guides) to support adoption and transparency. | 3rd |
| Infrastructure & Miscellaneous | $1,000 | Sia storage contracts, bandwidth, test datasets, and incidental project costs. | 3rd |
| Total | $9,500 | — |
Goals & Timeline
General Timeline for Completion (3 months):
- Month 1
- Implement dataset hashing + Merkle root generation
- Integrate Sia API for dataset commitment storage
- Test retrieval and verification
- Month 2
- Build training attestation module
- Dataset hash
- Code hash
- Model weights hash
- Store attestations on Sia as provenance certificates
- Build training attestation module
- Month 3
- Develop lightweight verification tool (CLI)
- Prepare documentation and open-source release
- Deliver demo materials (walkthrough video, usage guide)
Potential Risks
- Performance constraints: Large datasets may increase hashing/storage costs.
- Adoption barrier: Enterprises may hesitate to adopt new provenance standards.
- Technical complexity: Integrating cryptographic proofs with ML workflows requires careful design.
Mitigation: Start with lightweight commitments + Sia anchoring, then expand to advanced proofs (ZKPs, TEEs) in later phases.
Development Information
Will all of your project’s code be open-source?
Yes. All code will be released under an MIT or Apache 2.0 license.
Link where code will be accessible for review:
Gravity-3d/ProofChain: A cryptographic proof provider for AI companies
Do you agree to submit monthly progress reports?
Yes.
Cloud vs. Dedicated Server Cost Comparison
| Option | Monthly Cost | 12-Month Cost | Notes |
|---|---|---|---|
| Cloud (1× A100 GPU instance) | ~$2,500 | ~$30,000 | Cheapest “serious” GPU instance, but still 5× more expensive than owning hardware. |
| Cloud (8× A100 GPU cluster) | ~$24,000 | ~$288,000 | Typical for large AI training; far beyond grant scope. |
| Dedicated Server | ~$100 (power) | ~$6,200 | One‑time ~$5k purchase + electricity. Fully owned, reusable asset. |
Conclusion:
A $5,000 dedicated server saves ~$25,000 over a year compared to the cheapest cloud GPU option, while aligning with Sia’s mission of user‑owned infrastructure.
Contact Info
Email: [email protected]
Other preferred contact methods: Discord: @x73d