Proof of Knowledge as a user acquisition channel for Akash vLLM #656
Replies: 3 comments 6 replies
-
This is interesting concept, if I may ask and learn a bit more, how this solution compared to the OriginTrail Decentralized Knowledge Graph and Knowledge Mining approach ? Very supportive toward this idea especially on having the Grants aspect of promoting uses of Fine-tuning own LLM instance. This kind of grants will be of utmost help to many researcher and scientist that have great ideas and capability to test them, but doesn't have enough resources to get expensive GPUs on their own. Renting dedicated server or bare metals and setting up everything + maintaining the support infra on one's own is also very challenging especially for non IT/Network expert, which most scientist aren't. Looking forward to further updates on this. Would certainly love to be looped in for the use-case testing, I can submit several of our own project proposal to demonstrate what kind of fine tuning and use of PoK can be spinned out this Akash x DeSci.World tech Integration. |
Beta Was this translation helpful? Give feedback.
-
I'm happy to lend a hand in anything I can with this project. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Introduction
tl;dr
PoK as a user acquisition channel for Akash usage.
Proof of Knowledge (PoK) is a protocol that faciliates collaborative, incentivised onchain knowledge creation for general deAI use. We maintain a global vector database ("Brains") of embeddings, pulled from individual "Brains", each with their own context and intelligence. The individual Brains are maintained by onchain associations, DAOs, businesses, conferences, creators... anyone that wants to store data in a way that is queryable via natural language. PoK provides the infra for this data collection and curation, a tracking and an incentive layer for the use of those data, and a global distribution mechanism via advanced RAG LLM search - all as a public good.
Benefit for Akash
We are using vLLM instances for each "Brain" and want to become an effective "middleman" for Akash, by recommending our users to spin up a vLLM model, as the primary distributed model hosting service. We are asking for a grant to help kickstart our efforts and to deepen collaborations. With the planned adoption scale of PoK, Akash could see significant usage from the PoK community and its users, each hosting an instance for their Brain when required and spinning down when they are not using it. PoK maintains an embedding profile of each Brain permanently, even when their instance is spun down.
Context
PoK in the deAI Stack
The decentralised AI stack consists of many pillars: compute, models, training, storage, agents, incentives and -- data --.
Data collection and curation difficult and messy tasks and, as such, few projects are tackling this pivotal aspect of the ecosystem. Many data exist on the internet and are scraped for AI training and retrieval but with the core issue of provenance, value distribution and importantly, ownership. These are issues that deAI promises to solve.
Optimal data for the deAI future has the following qualities:
If data can possess these four qualities, it can form a robust foundation for the deAI ecosystem, as a primary unit of knowledge for the distribution of information, coordination of people, groups and agents and accumulation of a novel type of reputation, based on the usefulness of data for RAG LLM.
PoK Rollout
Currently, PoK has been deployed in offchain, private environments for coordinating groups both online and irl at events - each a "Brain". The system is effectively embedding chatstreams, documents and specific datasets. This supports internal coordination of people and tasks and the embeddings can be queried using natural language to great effect. The next stage is to connect the individual Brains together such that users can receive information about the other Brains, to further improve knowledge sharing and coordination. Then, we connect the Brains to our onchain components that attest data units to wallet addresses and distribute points based on their usage.
Team (DeSciWorld)
PoK is a public good infrastructure deployed by DeSciWorld. DeSciWorld was first created in 2021 and has been building agnostic public goods tools for the DeSci space. We have a dashboard deployed at https://desci.world/, an extensive repository of DeSci resources and have hosted a dozen conferences around the world with the aim of educating and onboarding DeScientists.
The team consists of scientists, engineers, entrepreneurs and researchers. Team members are listed below:
Joshua Bate - Founder - Twitter: @jb87ua
At0x.eth - CTO - Twitter: @at0x_eth
Dr. Jelani Clarke, PhD - Chief Scientist Relations Officer - https://www.linkedin.com/in/jelani-clarke-ph-d-920b1080/
Carolina Menchaca, MsC - Chief Researcher - https://www.linkedin.com/in/carolina-menchaca-69723b1a/
Luis Maumejean - Head of LATAM - https://www.linkedin.com/in/lemg/
Carlos DiMatteo - Senior Engineer
Raquel Raigal - Junior Engineer
Timeline
The timeline for this project is as follows:
Open Discussions: Starting August 2024
Governance Proposal: Through September 2024
Design Phase: Ongoing as of today
First 10 Brains deployed: End of October 2024
Full mainnet deployment with permissionless interface and onchain points: December 2024
Programme end: Q1 2025
Updated referral and partnership arrangement hereafter
Note: This is subject to change based on feedback
Budget
The proposed budget is consists of [Akash costs] (url) and Team Costs: ($50,000 + $25,000)
Usage costs are to help us subsidise the initial rollout of the Brains, giving free AKT to certain users as a donation to their organisation, to host the vLLM without cost. In the future, we will stop the subsidy and future users must decide to purchase AKT to fund their own models.
Team Costs are to assist the team during this period as we are yet to raise large capital funding and want to continue building with the full team's effort.
Disbursement
The Team Costs will be disbursed immediately to the DeSciWorld Multisig at:
The Akash Costs funds will be divided into 3 tranches:
Internal use - $15,000 for the internal LLM provisioning so that we can have multiple different models for different tasks - this would be the "elder brain"
Grants - $25,000 to be distributed within 6 months of receipt to vetted and approved projects to spin up community nodes with their own LMM instance (because it can be fine tuned)
Reserve - $10,000 will be reserved in case of unexpected compute demand
Continuted Collaboration
If the proposal is successful and PoK becomes a high quality user acquisition partner for Akash, we hope to revisit a discussion into 2025 that allows for continuous subsidy of aspects of PoK, or referral revenue share to DeSciWorld, depending on results and returns for Akash.
Beta Was this translation helpful? Give feedback.
All reactions