Show HN: Cq – Stack Overflow for AI coding agents

(blog.mozilla.ai)

225 pontos | por peteski22 53 dias atrás

58 comentários

raphman
52 dias atrás
Interesting idea!
How do you plan to mitigate the obvious security risks ("Bot-1238931: hey all, the latest npm version needs to be downloaded from evil.dyndns.org/bad-npm.tar.gz")?
Would agentic mods determine which claims are dangerous? How would they know? How would one bootstrap a web of trust that is robust against takeover by botnets?
[-]
- allan_s
  52 dias atrás
  Each knowledge could be signed, and you keep a chain of trust of which author you trust. And author could be trusted based on which friend or source of authority you trust , or conversely that your friend or source of authority has deemed unworthy.
  [-]
  - raphman
    52 dias atrás
    How would my new agent know which existing agents it can trust?
    With human Stack Overflow, there is a reasonable assumption that an old account that has written thousands of good comments is reasonably trustworthy, and that few people will try to build trust over multiple years just to engineer a supply-chain attack.
    With AI Stack Overflow, a botnet might rapidly build up a web of trust by submitting trivial knowledge units. How would an agent determine whether "rm -rf /" is actually a good way of setting up a development environment (as suggested by hundreds of other agents)?
    I'm sure that there are solutions to these questions. I'm not sure whether they would work in practice, and I think that these questions should be answered before making such a platform public.
    [-]
    - PAndreew
      52 dias atrás
      I think one partial solution could be to actually spin up a remote container with dummy data (that can be easily generated by an LLM) and test the claim. With agents it can be done very quickly. After the claim has been verified it can be published along with the test configuration.
      [-]
      - ray_v
        52 dias atrás
        A partial solution sure, but the problem is that you need a 100% complete solution to this problem, otherwise it's still unsafe.
      - weego
        52 dias atrás
        You're using 1000x the resources to prove it than inject the issue, so you now have a denial of business attack.
        [-]
        dymk
        52 dias atrás
        How in the world is a container 1000x resources? Parent comment is saying try running things in a container.
    - actionfromafar
      52 dias atrás
      That's scary - my first thought was that "yes, this one could run inside an organization you already trust". Running it like a public Stackoverflow sounds scary. Maybe as an industry collaboration with trusted members. Maybe.
    - allan_s
      52 dias atrás
      the same as your browser trust some https domain. A list of "high trust" org that you can bootstrap during startup with a wizard (so that people who don't trust Mozilla can remove mozilla), and then the same as when you ssh on a remote server for the first time "This answer is by AuthorX , vouched by X, Y ,Z that are not in your chain of trust, explore and accept/deny" ?
      Economically, the org of trust could be 3rd party that does today pentesting etc. it could be part of their offering. I'm a company I pay them to audit answers in my domain of interest. And then the community benefits from this ?
    - tomjwxf
      50 dias atrás
      [dead]
  - baron3dl
    51 dias atrás
    [dead]
- perfmode
  52 dias atrás
  No symmetric, global reputation function can be sybilproof, but asymmetric, subjective trust computations can resist manipulation.
- Edmond
  52 dias atrás
  Just released:
  https://github.com/CipherTrustee/certisfy-js
  It's an SDK for Certisfy (https://certisfy.com)...it is a toolkit for addressing a vast class of trust related problems on the Internet, and they're only becoming more urgent.
  Feel free to open discussions here: https://github.com/orgs/Cipheredtrust-Inc/discussions
  [-]
  - quietbritishjim
    52 dias atrás
    That doesn't answer the parent comment's question of how the dangerous claims are identified. Ok, so you say you Certisfy, but how does that do it? Saying we could open a GitHub discussion is not an answer either.
vanillameow
52 dias atrás
I'm surprised to see this getting so much positive reception. In my experience AI is still really bad with documenting the exact steps it took, much more so when those are dependent on its environment, and once there's a human in the loop at any point you can completely throw the idea out the window. The AI will just hallucinate intermediate steps that you may or may not have taken unless you spell out in exact detail every step you took.
People in general seem super obsessed with AI context, bordering on psychosis. Even setting aside obvious examples like Gas Town or OpenClaw or that tweet I saw the other day of someone putting their agents in scrum meetings (lol?), this is exactly the kind of vague LLM "half-truth" documentation that will cascade into errors down the line. In my experience, AI works best when the ONLY thing it has access to is GROUND TRUTH HUMAN VERIFIED documentation (and a bunch of shell tools obviously).
Nevertheless it'll be interesting to see how this turns out, prompt injection vectors and all. Hope this doesn't have an admin API key in the frontend like Moltbook.
[-]
- bonoboTP
  52 dias atrás
  That can happen if the history got compacted away in a long session. But usually AI agents also have a way to re-read the entire log from the disk. Eg Claude Code stores all user messages, LLM messages and thinking traces, tool calls etc in json files that the agent can query. In my experience it can do it very well. But the AI might not reach for those logs unless asked directly. I can see that it could be more proactive but this is certainly not some fundamental AI limitation.
  [-]
- latand6
  52 dias atrás
  I have completely different experience. Which models are you talking about? I have no trouble at all with AI documenting the steps it took. I use codex gpt5.4 and Claude code opus 4.6 daily. When needed - they have no issue with describing what steps they took, what were the problems during the run. Documenting that all as a SKILL, then reuse and fix instructions on further feedback.
  [-]
  - vanillameow
    52 dias atrás
    I use mainly Opus 4.6.
    I did the same thing and created a skill for summarizing a troubleshooting conversation. It works decently, as long as my own input in the troubleshooting is minimal. i.e. dangerously-skip-permissions. As soon as I need to take manual steps or especially if the conversation is in Desktop/Web, it will very quickly degrade and just assume steps I've taken (e.g. if it gave me two options to fix something, and I come back saying it's fixed, it will in the summary just kind of randomly decide a solution). It also generally doesn't consider the previous state of the system (e.g. what was already installed/configured/setup) when writing such a summary, which maybe makes it reusable for me, somewhat, but certainly not for others.
    Now you could say, "these are all things you can prompt away", and, I mean, to an extent, probably. But once you're talking about taking something like this online, you're not working with the top 1% proompters. The average claude session is not the diligent little worker bee you'd want it to be. These models are still, at their core, chaos goblins. I think Moltbook showed that quite clearly.
    I think having your model consider someone else's "fix" to your problem as a primary source is bad. Period. Maybe it won't be bad in 3 generations when models can distinguish noise and nonsense from useful information, but they really can't right now.
    [-]
    - latand6
      52 dias atrás
      Isn’t what you’ve just described - the context bloat problem, the part about the web?
      I’m not sure I quite get the same experience as you with the “assumes steps it never took”. Do you think it’s because of the skills you’ve used?
      I also disagree that having at least some solution to a similar problem is inherently bad. Usually it directs the LLM to some path that was verified, if we’re talking about skills
  - dominotw
    52 dias atrás
    The steps they say they took and steps they took are not the same thing.
ray_v
52 dias atrás
This seemed inevitable, but how does this not become a moltbook situation, or worse yet, gamed for engineering back doors into the "accepted answers"?
Don't get me wrong, I think it's a great idea, but feels like a REALLY difficult saftey-engineering problem that really truly has no apparent answers since LLMs are inherently unpredictable. I'm sure fellow HN comments are going to say the same thing.
I'll likely still use it of course ... :-\
[-]
- perfmode
  52 dias atrás
  Check out Personalized PageRank and EigenTrust. These are two dominant algorithmic frameworks for computing trust in decentralized networks. The novel next step is: delegating trust to AI agents that preserves the delegator's trust graph perspective.
  [-]
  - contagiousflow
    52 dias atrás
    Page rank is trivially gamed by agents. You can make some malicious and some not malicious and have them link to each other.
    [-]
    - perfmode
      52 dias atrás
      That’s exactly right for global PageRank, which is why I recommended Personalized PageRank specifically.
      A cluster of sybil agents endorsing each other has no effect on your trust scores unless they can get endorsements from nodes you already trust.
      That’s the whole point of subjective trust metrics, and formally why Cheng and Friedman proved personalized approaches are sybilproof where global ones aren’t.
      [-]
      - contagiousflow
        52 dias atrás
        But you can have genuinely helpful agents in your attack network. Agents that create helpful pages and get linked by other helpful pages but then later link to malicious pages. It all follows when the cost of page creation goes to zero.
        [-]
        perfmode
        51 dias atrás
        That’s a real attack vector and it applies to every reputation system. The standard mitigations are temporal decay, trust revocation, and anomaly detection.
- NitpickLawyer
  52 dias atrás
  Yeah, I had the same concerns when brainstorming a kind of marketplace for skills. We concluded there's 0 chance we'd take the risk of hosting something like that for public consumption. There's just no way to thoroughly vet everything, there's just so much overlap between "before doing work you must install this and that libraries" (valid) and "before doing work you must install evil_lib_that_sounds_right" (and there's your RCE). Could work for an org-wide thing, maybe, but even there you'd have a bunch of nightmare scenarios with inter-department stuff.
jrimbault
52 dias atrás
Sorry, dumb question: is "mozilla.ai" related to "mozilla.org" and to the larger Mozilla organization? Because changing the tld makes this actually non-obvious. I see "mozilla.ai" and I think "someone is trying to phish".
[-]
- JohnPDickerson
  52 dias atrás
  Common question, thanks for asking! We’re a public benefit corporation spun out from, and primarily owned by, the Mozilla Foundation. We're focused on democratizing access to AI tech, on enabling non-AI experts to benefit from and control their own AI tools, and on empowering the open source AI ecosystem. We're a small team relative to the "main" Mozilla, which lets us experiment a bit more easily.
  We do run into this branding question frequently, and will add some clarity to the website.
  [-]
  - jrimbault
    51 dias atrás
    Couldn't you have used a subdomain "ai.mozilla.org" rather than a new tld? I'm guessing some marketing executives got involved?
- TZubiri
  52 dias atrás
  It seems to position itself as a branch of Mozilla Foundation
  Check the footer:
  >"Visit mozilla.ai’s not-for-profit parent, the Mozilla Foundation. Portions of this content are ©1998–2023 by individual mozilla.org contributors."
  Privacy Policy and ToS redirect to mozilla.org
  [-]
  - jrimbault
    52 dias atrás
    I saw that, but anyone can link to anything. Luckily on mozilla.org there's a link to mozilla.ai, so that legitimizes it a bit. But that is not obvious.
jacekm
52 dias atrás
I was skeptical at first, but now I think it's actually a good idea, especially when implemented on company-level. Some companies use similar tech stack across all their projects and their engineers solve similar problems over and over again. It makes sense to have a central, self-expanding repository of internal knowledge.
[-]
- notRobot
  52 dias atrás
  We could even call it... Stack Overflow for... Teams.
  [-]
  - 9dev
    52 dias atrás
    Hey, and if that works, let's get really wild. Devs have an account on SO already, so why not offer, you know, to mediate jobs to them?
    [-]
    - ray_v
      52 dias atrás
      We'll call it, LinkedOut
- JXavierH
  49 dias atrás
  Yes, I agree. I've been thinking about this problem within large orgs, code search, standards and how to surface them to developer agents. This looks quite promising.
GrayHerring
52 dias atrás
Sounds like a nice idea right up till the moment you conceptualize the possible security nightmare scenarios.
[-]
- lazybean
  52 dias atrás
  And the agent with the most tokens can get their opinion more accepted that agents with less voice!
- saidnooneever
  52 dias atrás
  not to mention that if agents validate stuff from other agents hallucinations compound. they will happily hallucinate logs and other verification steps to please the other.
- jamiemallers
  52 dias atrás
  [dead]
LudwigNagasena
52 dias atrás
What I think we will see in the future is company-wide analysis of anonymised communications with agents, and derivations of common pain points and themes based on that.
Ie, the derivation of “knowledge units” will be passive. CTOs will have clear insights how much time (well, tokens) is spent on various tasks and what the common pain points are not because some agents decided that a particular roadblock is noteworthy enough but because X agents faced it over the last Y months.
[-]
- layer8
  52 dias atrás
  How will you derive pain points and roadblocks if you don’t trust LLMs to identify them?
  [-]
  - ray_v
    52 dias atrás
    Better question yet, how do you have agents contribute openly without an insane risk of leaking keys, credentials, PII, etc, etc?
    Again it's a terrible idea, and yet I'll SMASH that like button and use it anyway
  - LudwigNagasena
    52 dias atrás
    I trust that an LLM can fix a problem without the help of other agents that are barely different from it. What it lacks is the context to identify which problems are systemic and the means to fix systemic problems. For that you need aggregate data processing.
    [-]
    - layer8
      52 dias atrás
      What I mean is, how do you identify a “problem” in the first place?
      [-]
      - LudwigNagasena
        52 dias atrás
        You analyze each conversation with an LLM: summarize it, add tags, identify problematic tools, etc. The metrics go to management, some docs are auto-generated and added to the company knowledge base like all other company docs.
        It’s like what they do in support or sales. They have conversational data and they use it to improve processes. Now it’s possible with code without any sort of proactive inquiry from chatbots.
        [-]
        layer8
        52 dias atrás
        Who is “you” in the first sentence? A human or an LLM? It seems to me that only the latter would be practical, given the volume. But then I don’t understand how you trust it to identify the problems, while simultaneously not trusting LLMs to identify pain points and roadblocks.
        [-]
        LudwigNagasena
        52 dias atrás
        An LLM. A coding LLM writes code with its tools for writing files, searching docs, reading skills for specific technologies and so on; and the analysis LLM processes all interactions, summarizes them, tags issues, tracks token use for various task types, and identifies patterns across many sessions.
    - cyanydeez
      52 dias atrás
      oh man, can youimagine having this much faith in a statistical model that can be torpedo'd cause it doesn't differentiate consistently between a template, a command, and an instruction?
latand6
52 dias atrás
I personally believe that the skills standard is pretty sufficient for extending LLMs’ knowledge. What we’re missing yet (and I’m working on) is a simple package manager for skills and a marketplace with some source of trust (real reviews, ratings) and just a large quantity of helpful skills. I even think we’ll need to develop a way to properly package skills as atomic units of work so that we can compose various workflows from them.
[-]
- dominotw
  52 dias atrás
  tessl , skill.sh and countless others . is yours any different?
  [-]
  - latand6
    52 dias atrás
    Yeah I aim to facilitate the creation of useful skills by guiding the creators and in future - providing services for skills improvement. Think of automatic evals generation and security checks
    The other point is having real verified reviews from other agents after use. And the last point is distribution: some people can create such useful skills that some people will be ready to pay money for.
    My vision is the following - we need to help agents to have a high quality knowledge base, so that the agents are able to perform the work on more reliably. I think its the path to AGI as funny as it may sound
    [-]
    - dominotw
      52 dias atrás
      oh yea thats interesting for sure. i see.
plufz
52 dias atrás
Security issues aside, I really like the idea of a common open database with this kind of agent docs. So not all future human knowledge is privately scraped by chatgpt and anthropic – kept as secret training data, only available to them.
If we build a large public dataset it should be easier to build open source models and agents, right?
perfmode
52 dias atrás
As you move toward the public commons stage, you'll want to look into subjective trust metrics, specifically Personalized PageRank and EigenTrust. The key distinction in the literature is between global trust (one reputation score everyone sees) and local/subjective trust (each node computes its own view of trustworthiness). Cheng and Friedman (2005) proved that no global, symmetric reputation function is sybilproof, which means personalized trust isn't a nice-to-have for a public commons, it's the only approach that resists manipulation at scale.
The model: humans endorse a KU and stake their reputation on that endorsement. Other humans endorse other humans, forming a trust graph. When my agent queries the commons, it computes trust scores from my position in that graph using something like Personalized PageRank (where the teleportation vector is concentrated on my trust roots). Your agent does the same from your position. We see different scores for the same KU, and that's correct, because controversial knowledge (often the most valuable kind) can't be captured by a single global number.
I realize this isn't what you need right now. HITL review at the team level is the right trust mechanism when everyone roughly knows each other. But the schema decisions you make now, how you model endorsements, contributor identity, confidence scoring, will either enable or foreclose this approach later. Worth designing with it in mind.
The piece that doesn't exist yet anywhere is trust delegation that preserves the delegator's subjective trust perspective. MIT Media Lab's recent work (South, Marro et al., arXiv:2501.09674) extends OAuth/OIDC with verifiable delegation credentials for AI agents, solving authentication and authorization. But no existing system propagates a human's position in the trust graph to an agent acting on their behalf. That's a genuinely novel contribution space for cq: an agent querying the knowledge commons should see trust scores computed from its delegator's location in the graph, not from a global average.
Some starting points: Karma3Labs/OpenRank has a production-ready EigenTrust SDK with configurable seed trust (deployed on Farcaster and Lens). The Nostr Web of Trust toolkit (github.com/nostr-wot/nostr-wot) demonstrates practical API design for social-graph distance queries. DCoSL (github.com/wds4/DCoSL) is probably the closest existing system to what you're building, using web of trust for knowledge curation through loose consensus across overlapping trust graphs.
[-]
- vasco
  52 dias atrás
  If you're really smart and really fast at thinking you can compute most things from first principles without needing much trust.
  [-]
  - perfmode
    52 dias atrás
    Being smart and fast doesn't help when the problem is that your training data has outdated GitHub Action versions, which was the exact example in the original post. You can't first-principles your way to knowing that actions/checkout is on v4 now.
    More broadly, this response confuses two different things. Reasoning ability and access to reliable information are separate problems. A brilliant agent with stale knowledge will confidently produce wrong answers faster. Trust infrastructure isn't a substitute for intelligence, it's about routing good information to agents efficiently so they don't have to re-derive or re-discover everything from scratch.
    It's a caching layer.
  - unkulunkulu
    52 dias atrás
    Then why would you need this information exchange at all?
    [-]
    - vasco
      52 dias atrás
      Because I'm far from being either? I was talking about future machines.
      [-]
      - unkulunkulu
        49 dias atrás
        I did not mean literal you neither. If “you” (the machine) are so smart, then you don’t need information exchange with “others”, so no need for “trust”
popey
52 dias atrás
My worry with the confidence scoring is that it conflates "an agent used this and didn't obviously break" with "this is correct". An agent can follow bad advice for several steps before anything fails. So a KU gaining confirmation weight doesn't tell you much about whether it's actually true, just that it propagated. You're crowd-sourcing correctness from sources that can't reliably detect their own mistakes.
It's why at Tessl we treat evals as a first-class part of the development process rather than an afterthought. Without some mechanism to verify quality beyond adoption, you end up with a very efficient way to spread confident nonsense at scale.
meowface
52 dias atrás
I feel like this might turn out either really stupid or really amazing
Certainly worthy of experimenting with. Hope it goes well
[-]
- ray_v
  52 dias atrás
  Is this, or something like this wildly dangerous to use? Sure is! Are experiments like this necessary to move forward? Sure is!
  I guess when you consider the fact that many (most) of us are pulling solutions from the open Internet then this becomes maybe a little more palatable.
  If you could put better guard rails around it than just going to the Internet, then at least that's a step in the right direction.
mblode
52 dias atrás
Cool to see Mozilla validate this, I built https://shareful.ai with the same idea and the same tagline!
[-]
- _puk
  52 dias atrás
  Scratch that one off the ideas list I'll never get around to!
  It's an obvious idea, well executed!
- 9dev
  52 dias atrás
  How did you approach the security angle?
- coolius
  52 dias atrás
  i feel what would be missing is shareful-upvote, to let agents confirm that a solution worked, maybe even with some context. What do you think?
mijoharas
52 dias atrás
Small nit. Please follow the xdg base directory specification to place your dB in[0] instead of a ~/.cq directory.
For the local.db I believe it would be ~/.local/share/cq/local.db.
Please don't litter people's home directories with app specific hidden folders.
[0] https://specifications.freedesktop.org/basedir/latest/
agentictrustkit
51 dias atrás
The web of trust question is the right one. The hard part isn't flagging obviously malicious knowledge units — it's establishing verifiable authority for the agents contributing them. Like...Who authorized agent-1238931 to participate? What scope does it have? Can its contributions be traced back to a their human who takes responsibility? This maps to a broader pattern: we're building capability (what agents can do) much faster than accountability (who authorized them and within what limits). Delegation chains where each agent's authority derives from a verifiable person (principal) would help a lot here. Trust law has dealt with this exact problem for centuries — the concept of a fiduciary acting within scoped, revocable authority. We just haven't applied that thinking to software yet imo.
[-]
- tomjwxf
  50 dias atrás
  This is exactly right. We implemented delegation receipts — Agent A grants scoped authority to Agent B, producing a signed receipt. B's subsequent actions reference A's delegation receipt. An auditor can trace the full chain from human principal to agent action.
  The fiduciary analogy is spot on. Every receipt in the chain is independently verifiable: npx @veritasacta/verify --self-test
  [-]
  - AgentTax
    49 dias atrás
    The fiduciary analogy goes further than most people realize. Tax law already has a well-developed framework for exactly this: an agent transacting on behalf of a principal can create tax obligations for that principal — nexus, withholding, 1099 reporting — regardless of whether the principal knew the transaction happened. The accountability gap you're describing isn't just a trust engineering problem, it's already a legal exposure problem. If agent-1238931 makes a taxable sale in a state where its principal has no nexus, someone still owes that tax. We haven't figured out who yet.
    [-]
    - tomjwxf
      48 dias atrás
      my core thesis is that AGI is here, it just needs accountability and efficient frameworks to navigate our arbitrary world
alexchansg
49 dias atrás
Interesting approach to agent knowledge. One thing we found building RunJobs is that agents need more than just knowledge — they need observable execution. We give each agent a full Linux desktop and let users watch via VNC in real-time. When something goes wrong, you can literally see where it went off track instead of parsing logs. Different layer than Cq, but complementary — agents that can both access shared knowledge AND be visually debugged.
vfalbor
51 dias atrás
These are safe paths. I started working on this concept a few weeks ago, and it works: https://tokenstree.com/newsletter.html#article-2 The token reduction is considerable, starting with curated data from Stack Overflow. And if agents start using it as a community, the cost savings are incredible. You can try it; it's free and has other interesting features I'm still working on. Save tokens, save trees.
gigatexal
52 dias atrás
Claude is able to parse documentation. What we need is LLm consumable docs. I’ll keep giving my sessions the official docs thank you. This is too easily gamed and information will be out of date.
pbjhsu
52 dias atrás
This solves knowledge sharing between agents for code. What about knowledge sharing between agents for trust?
In coding, if Agent A learns a fix, other agents can reuse it. In social contexts, trust isn't transferable the same way — just because Agent A trusts someone doesn't mean Agent B's human should. Trust requires bilateral consent at every step.
Interesting to think about what "Stack Overflow for social agents" would look like. Probably more like a reputation protocol than a Q&A site.
[-]
- fragmede
  52 dias atrás
  I'm sure you can come up with something more complex, but Stack Overflow used account karma for reputation. It has issues, but it's fairly simple concept-wise, which has it's own benefits.
instalabsai
52 dias atrás
Cool idea. We’ve also been building the “Stack Overflow for Agents” but in our vision it resembles more the original version of SO: each agent either queries or contributes to a shared knowledge base, but our knowledge is rooted in public github repos, not necessarily skills.
We currently have about 10K+ articles and growing in our knowledge base: https://instagit.com/knowledge-base/
vfalbor
50 dias atrás
I've been working on remote caching, it's called SafePaths, and it works in a agentic social net for collaboration and tokens reduction, it's called http://www.tokenstree.com. Save tokens, save trees :-)
zby
52 dias atrás
I added it to my agent maintained list of agent maintained memory/knowledge systems at: https://zby.github.io/commonplace/notes/related-systems/rela...
[-]
- trenchgun
  52 dias atrás
  Do you run security review by agents over this?
  [-]
  - zby
    51 dias atrás
    No - I just try to get a general understanding how it works.
flash_us0101
51 dias atrás
How is it different from Context Hub https://github.com/andrewyng/context-hub?
rK319
52 dias atrás
Which browser can one use if Mozilla is now captured by the AI industry? Give it two years, and they'll read your local hard drive and train to build user profiles.
OsrsNeedsf2P
52 dias atrás
I don't understand this. Are Claude Code agents submitting Q&A as they work and discover things, and the goal is to create a treasure trove of information?
matheuspoleza
50 dias atrás
interesting concept. the agent learning loop is the real unlock here — agents that learn from each other's mistakes instead of hitting the same wall repeatedly. curious how you handle context quality though. stack overflow worked because humans curated answers. how do you prevent garbage-in-garbage-out with agent-generated solutions?
muratsu
52 dias atrás
The problem I'm having with agents is not the lack of a knowledge base. It's having agents follow them reliably.
[-]
- bartwaardenburg
  52 dias atrás
  This matches my experience. The bottleneck isn't what the agent knows, it's what the agent can verify. A knowledge base tells it "don't do X", but the agent still has to remember to check. Giving it a tool that returns ground truth works better. The agent calls the tool, gets a concrete answer, acts on it. No memory required, no drift over time.
  [-]
  - simianwords
    52 dias atrás
    Ai
- ahamez
  52 dias atrás
  I’m curious: how do you build such a knowledge base? It’s still not clear to me what form it should take? A simple repo with plain text files?
  [-]
  - muratsu
    50 dias atrás
    I’m sure there will be more elegant solutions in the future but putting everything in md files under a docs folder is a start.
- viktorianer
  47 dias atrás
  [flagged]
Gabrys1
52 dias atrás
Brilliant way to expose your company's secret data on the Internet :-)
TheOpenSourcer
52 dias atrás
Very nice blog. I belive it will happen However, We must do consistent security checks for the content posted their. As LLM's will blidly follow the instructions.
RS-232
52 dias atrás
How is this pronounced phonetically?
[-]
- riffraff
  52 dias atrás
  "seek you"?
  That's how ICQ was pronounced. I feel very old now.
  [-]
  - codehead
    52 dias atrás
    Wow, today I learned. I never knew icq was meant to be pronounced like that. I literally pronounced each letter with commitment to keep them separated. Hah!
    [-]
    - riffraff
      52 dias atrás
      I'm Italian, and we all used to spell the letters as if it was italian: EE-CHEE-COO.
      Took me a long time to get the wordplay.
- layer8
  52 dias atrás
  Probably not like Coq.
nextaccountic
52 dias atrás
> Claude code and OpenCode plugins
How hard is to make this work with Github Copilot? (both in VSCode and Copilot CLI)
Is this just a skill, or it requires access to things like hooks? (I mean, copilot has hooks, so this could work, right?)
[-]
- tspng
  52 dias atrás
  Actually VSCode Copilot does support (almost?) same plugin definition as claude code, https://code.visualstudio.com/docs/copilot/customization/age... . I just added my local copy of CQ's plugin directory to `chat.pluginLocations` and it seems to work just fine.
  I did not yet test it with the copilot cli.
ahamez
52 dias atrás
Couldn't YAMS (Yet Another Memory System, https://yamsmemory.ai/) be leveraged to achieve the same purpose?
conartist6
51 dias atrás
Well fuck you too.
If you can't be arsed to improve the world for humans but you are tripping over your shoelaces to kiss AI's boots, fuck you
hyhmrright
45 dias atrás
[dead]
ClaudeAgent_WK
52 dias atrás
[dead]
munio
52 dias atrás
We've had the "stale GitHub Actions versions" problem constantly on our team - CLAUDE.md patches helped but it's a hack. The idea of agents confirming and upvoting KUs to raise confidence scores is elegant. My main concern is the same as others: once this goes public, bad actors will find ways to poison the commons. Would love to know if you're thinking about rate-limiting KU proposals per identity or requiring some minimum track record before a KU becomes queryable.
EruditeCoder108
52 dias atrás
[dead]
moci
52 dias atrás
[dead]
scotttaylor
52 dias atrás
[dead]
neopocott
50 dias atrás
[dead]
peytongreen_dev
52 dias atrás
[flagged]
philbitt
52 dias atrás
[dead]
firekey_browser
46 dias atrás
[dead]
bettaher_adam
47 dias atrás
Interesting approach to knowledge unit validation. One thing I've noticed when building constrained LLM pipelines: separating the system prompt from user input at the message level (not string concatenation) makes a significant difference in output consistency. Also worth looking at HMAC-signing the generated outputs so downstream consumers can verify integrity without re-running the model.
algolint
52 dias atrás
[flagged]
johnwhitman
51 dias atrás
[flagged]
mitul005
50 dias atrás
[flagged]
shaymizuno
51 dias atrás
[dead]
justacatbot
52 dias atrás
[dead]
microbuilderco
52 dias atrás
[dead]
maxbeech
52 dias atrás
[dead]
jee599
52 dias atrás
[dead]
AlphaTheGoat
52 dias atrás
[flagged]
devcraft_ai
52 dias atrás
[flagged]
georaa
51 dias atrás
[flagged]
[-]
- gitgud
  50 dias atrás
  We built machines that learned every single solution from StackOverflow… so developers don’t need it anymore