« BackZclaw – The 888 KiB Assistantzclaw.devSubmitted by kristianpaul 4 days ago
  • tehsauce 2 days ago

    “888 KiB Assistant” but the assistant itself is a multi terabyte rental-only model stored in some mysterious data center.

    • seertaak a day ago

      The whole point is that this fits on an ESP32, which has wifi. We're not quite at the point where it makes sense to run the whole thing locally - if you do try it, it will need a fan, and be loud etc.

      For my part, I installed Nanoclaw on my Arch derived OS (I love Arch!), and it worked fine until the next day some update decided to revert the power management settings, and now my glorious assistant is dead.

      There's something to be said for a barebones OS. No bullshit, no updates.

      Also, playing with hardware watchdog timers and GPIOs and DACs can be so much fun.

      • amelius 2 days ago

        I'm getting "serverless" flashbacks.

        • pgt 19 hours ago

          modelless

          • stuaxo 19 hours ago

            This thread reminds me how Javas heavy GUI written in Java itself was called "lightweight" when in fact it did not feel lightweight at all on the hardware of the time.

        • kristianpaul 2 days ago

          My model is at home... just 16Gb still a lot but just FYI

          • Rebelgecko 2 days ago

            It seems to support connecting to your own LLM on the same LAN

            • croes a day ago

              The point is the agent is still the LLM. No LLM, no agent.

              • otterley a day ago

                LLMs are not agents. LLMs are language models that simply respond to a text prompt with a textual response. Agents are middleware that take input from the user and then use LLMs to drive tools.

                • croes a day ago

                  They are just a to-do list. The real work is done by the LLMs

                  • otterley a day ago

                    An LLM has no motive power, like a script without an a cast, or a program without a computer to execute it.

              • dheera a day ago

                I tried connecting OpenClaw to ollama with a V100 running qwen3.5:35b but it was really, really, really slow (despite ollama itself feeling fairly fast).

                These "claw" agents really multiply the tokens used by an obscenely huge factor for the same request.

                • jcgrillo a day ago

                  i recently decided to get into this ocean boiling game too, the 32GB V100 seems like a pretty good VRAM/$. if i may ask, do you make any special accommodations for cooling? i've never dealt with a passively cooled card before and i'm curious whether my workstation fans (HP Z840) will be sufficient. i'm going to try 2 cards at first but i think i might be able to squeeze a third in there

                  • dheera a day ago

                    No. I have a Titan V CEO edition, which is basically a 32GB V100 but has full active fan cooling which I'm finding works just fine.

                    • jcgrillo a day ago

                      Oh very cool. Some folks are printing shrouds for dual 40mm fans so I'll probably try that if the stock case fans don't do it

            • 0xbadcafebee 2 days ago

              For people who don't get this: it's a Home Assistant type thing. You don't do inference on it, you send it a message on Telegram and it does things with physical things through GPIO. You could use a $140 Raspberry Pi with 8GB RAM and host a local model on it plugged into 30W AC power... or you could use a $10 ESP32 which can run for weeks on a tiny battery, and your existing Wifi connection with a cheap cloud model (cloud models are as cheap as $0.02/1MTokens). This makes it easier to ramp up on new ESP32 projects. You can just tell it to do things / give you info, rather than having to write code.

              • Bridged7756 a day ago

                I wouldn't trust LLMs with anything, even low stakes things like handling my notes. I fail to see how people willingly give keys to the kingdom, why, asides from just FOMO/trend chasing.

                • newswasboring a day ago

                  The model can be taken care of in cloud but hardware also depends upon what we want to do right? If we want to run some lightweight python scripts etc, we cant use ESP32 right?

                  • 0xbadcafebee a day ago

                    The point is not to write python scripts at all. You just tell it to do something and it does it, no programming. It comes with tools the AI can use. https://zclaw.dev/use-cases.html

                    (you can write MicroPython for an ESP32, but that's not what this project is for)

                  • jcgrillo a day ago

                    dope so i can hook it up to my appliances, hvac, etc? maybe it'll do rad things with my smart doorlocks, remote car starter. can it send my self-driving waymobile to pick up my family at the airport too? such wow

                  • amelius 2 days ago

                    Me: "GPIO 5 can be active for a maximum of 100ms, then it needs to cool down for at least 1s. Otherwise the MOSFET is fried."

                    Zclaw: "GPIO 5 is active now, however the server is not responding so I'm awaiting further instructions."

                    • gas9S9zw3P9c 2 days ago

                      I fail to understand why 888 KiB matters if it's just a wrapper around a cloud api.

                      • ramon156 2 days ago

                        Have you seen OpenClaw's codebase? 680.000 LOC.

                        I care how big it is.

                        • Rebelgecko 2 days ago

                          A lot of the *claws emphasize binary size and lines of code. I think for better or worse people treat codebase size as a proxy for "how much of the project is unsupervised, unmaintainable, buggy AI slop?"

                          • mihaelm 2 days ago

                            Because of resource-constrained environments, the primary deployment target seem to be microcontrollers. You can get ESP32 boards for pretty cheap.

                            • boznz a day ago

                              8 is lucky number in China

                              • renewiltord 2 days ago

                                Because it means you can run it on an ESP32 which is a low power microprocessor package.

                                • undefined 2 days ago
                                  [deleted]
                              • Retr0id 2 days ago

                                888KiB is quite large, but I see they're including the whole rest of the firmware in that number, fair enough. Their actual application code weighs only 35,742 bytes, compiled.

                                • wartywhoa23 2 days ago

                                  Still lightyears from a one-bit AI assistant. Send 1 to save the humankind, 0 to exterminate. And hurry up because it's in undefined state right now!

                                  • cwoodyard 2 days ago

                                    [dead]

                                  • hidelooktropic 2 days ago

                                    There are many concerns and areas for improvement with open claw and other similar projects (continuous loop script with broad OS access that manages your agents and interfaces with a standard messaging app)

                                    However, file size I have never seen on that list. I would rather offer for something that is even bigger in file size so it afford certain functionality like better security tighter permissions however it would do that.

                                    • mihaelm 2 days ago

                                      File size is a legit property to keep in mind if your goal is to create an agent that runs on ESP32 boards. They don't expect you to run Zclaw on Mac Mini.

                                      • allthetime 2 days ago

                                        What's the use case for running this on a tiny board? Isn't the whole point that it can use your computer for you?

                                        • undefined a day ago
                                          [deleted]
                                          • mihaelm 2 days ago

                                            For something like OpenClaw yes, but not for Zclaw. I think the naming is more about riding the current wave of Claw-related interest rather than positioning it as competition or replacement for other clawies.

                                            Zclaw is about running an agent in your embedded system.

                                            • markstos 2 days ago

                                              The examples seem to suggest it would be chatting with your home automation in natural language.

                                              • fc417fc802 2 days ago

                                                Before you know it your smart thermostat will be blogging. The joke is on everyone who thought IoT couldn't get any worse. Just imagine the new landscape of security vulnerabilities this opens up.

                                                • markstos 2 days ago

                                                  My "smart" gas stove can be turned on over the internet (if I allow it to connect)—perfect appliance to put an LLM in charge of.

                                          • stavros 2 days ago

                                            I made a secure one:

                                            https://github.com/skorokithakis/stavrobot

                                            Everything runs in containers (I run it on a server along with everything else), plugins have a permission system so eg the AI can read emails but not delete or send, etc.

                                            I really like it, I run it as my main agent and it has been extremely helpful.

                                            • croes a day ago

                                              Part of the usefulness is based on the same thing that makes it so dangerous.

                                              If it can only read but not act, it’s safer but less useful.

                                              • stavros a day ago

                                                I can't restrict OpenClaw if I don't need the extra capabilities. I can restrict this.

                                                • croes 17 hours ago

                                                  You restrict OpenClaw by not providing it certain credentials.

                                                  • stavros 17 hours ago

                                                    Again, with my design you can give it fine-grained access to parts of services, which OpenClaw itself cannot do. This is just a fact.

                                          • jesse_dot_id 2 days ago

                                            Clicked on this expecting to see a crontab file.

                                            • giancarlostoro 2 days ago

                                              The domain crashed and burned or something, hopefully this link is correct:

                                              https://github.com/tnm/zclaw

                                              • jFriedensreich 2 days ago

                                                There is the same divide starting to form that NFTs had back in the day. Tech bros instantly like if something has claw in the name, the rest of us will dismiss anything with that naming and philosophy as toxic slop culture. will be interesting to see how far this one will go.

                                              • karolcodes 2 days ago

                                                1. why

                                                • human_hack3r a day ago

                                                  Exactly, So many *-Claw releases, with only distinction is either language or memory footprint.

                                                  • downboots 18 hours ago

                                                    Crypto

                                                    • dpe82 2 days ago

                                                      2. fun

                                                      • rvz 2 days ago

                                                        2. hype