Hacker News Clone

Comments

kelvinjps10Mar 27, 2026, 2:44 PM

I feel like a lot of people and companies wanted to automate the web, but most website's operators wouldn't let you and would block you. Now you put the name AI into and now you're allowed to do It.

simianwordsMar 27, 2026, 6:55 AM

I remember when I tried to set something up with the ChatGPT equivalent like "notify me only if there are traffic disruptions in my route every morning at 8am" and it would notify me every morning even if there was no disruption.

theredbeardMar 27, 2026, 6:58 AM

This is because for some reason all agentic systems think that slapping cron on it is enough, but that completely ignores decades of knowledge about prospective memory. Take a look at https://theredbeard.io/blog/the-missing-memory-type/ for a write-up on exactly that.

primer42Mar 27, 2026, 11:34 AM

“A programmer is going to the store and his wife tells him to buy a gallon of milk, and if there are eggs, buy a dozen. So the programmer goes shopping, does as she says, and returns home to show his wife what he bought. But she gets angry and asks, ‘Why’d you buy 13 gallons of milk?’ The programmer replies, ‘There were eggs!’”

You need to write a clearer prompt.

devsdaMar 27, 2026, 3:54 PM

"I need to fly to NY next weekend, make the necessary arrangement".

Your AI assistant orders an experimental jetpack from a random startup lab. Would you have honestly guessed that the prompt was "ambiguous" before you knew how the AI was going to act on it ?

jeremyjhMar 27, 2026, 12:50 PM

Did GP edit their comment? Or did you read the prompt they used somewhere else?

alexhansMar 27, 2026, 9:37 AM

Why not set your own evals and something like pi-mono for that? https://github.com/badlogic/pi-mono/

You'll define exactly what good looks like.

scottmcdotMar 27, 2026, 7:11 AM

Me too. It doesn't have ability to alert only on true positive. I has to also alert on true negative. So dumb

worldsayshiMar 27, 2026, 7:43 AM

This doesn't seem to hard to solve except for the ever so recurring llm output validation problem. If the true positive is rare you don't know if the earthquake alert system works until there's an earthquake.

g3f32rMar 27, 2026, 6:23 PM

... just force the data into a structured format, then use "hard code" on the structure.

"Generate the following JSON formatted object array representing the interruptions in my daily traffic. If no results, emit []. Send this at 8am every morning. {some schema}. Then run jsonreporter.py"

Then just let jsonreporter.py discriminate however it likes. Keep the LLMs doing what they are good at, and keep hard code doing what it's good at.

georaaMar 27, 2026, 5:08 PM

Scheduling is easy. The hard part is everything between "started" and "done" - task needs human approval at step 3, fails at step 5 (retry from 4 or from scratch?), takes 6 hours and something restarts. How do they handle tasks that span multiple inference calls? Is there checkpointing or does it start over?

javiercrMar 27, 2026, 8:13 AM

I've recently switched from GitHub Copilot Pro to Claude Code Max (20x). While Claude is clearly superior in many aspects, one area where it falls short is remote/cloud agents.

Yesterday, I spent the entire day trying to set up "Claude on the web" for an Elixir project and eventually had to give up. Their network firewall kept killing Hex/rebar3 dependency resolution, even after I selected "full" network access.

The environment setup for "on the web" is just a bash script. And when something goes wrong, you only see the tail of the log. There is currently no way to view the full log for the setup script. It's really a pain to debug.

The Copilot equivalent to "Claude on the web" is "GitHub Copilot Coding Agents," which leverages GitHub Actions infrastructure and conventions (YAML files with defined steps). Despite some of the known flaws of GitHub Actions, it felt significantly more robust.

"Schedule task on the web" is based on the same infrastructure and conventions as "Claude on the web", so I'm afraid I'm gonna have the same troubles if I want to use this.

iBelieveMar 27, 2026, 5:18 AM

Looks like I'm limited to only 3 cloud scheduled tasks. And I'm on the Max 20x plan, too :(

"Your plan gets 3 daily cloud scheduled sessions. Disable or delete an existing schedule to continue."

But otherwise, this looks really cool. I've tried using local scheduled tasks in both Claude Code Desktop and the Codex desktop app, and very quickly got annoyed with permissions prompts, so it'll be nice to be able to run scheduled tasks in the cloud sandbox.

Here are the three tasks I'll be trying:

Every Monday morning: Run `pnpm audit` and research any security issues to see if they might affect our project. Run `pnpm outdated` and research into any packages with minor or major upgrades available. Also research if packages have been abandoned or haven't been updated in a long time, and see if there are new alternatives that are recommended instead. Put together a brief report highlighting your findings and recommendations.

Every weekday morning: Take at Sentry errors, logs, and metrics for the past few days. See if there's any new issues that have popped up, and investigate them. Take a look at logs and metrics, and see if anything seems out of the ordinary, and investigate as appropriate. Put together a report summarizing any findings.

Every weekday morning: Please look at the commits on the `develop` branch from the previous day, look carefully at each commit, and see if there are any newly introduced bugs, sloppy code, missed functionality, poor security, missing documentation, etc. If a commit references GitHub issues, look up the issue, and review the issue to see if the commit correctly implements the ticket (fully or partially). Also do a sweep through the codebase, looking for low-hanging fruit that might be good tasks to recommend delegating to an AI agent: obvious bugs, poor or incorrect documentation, TODO comments, messy code, small improvements, etc.

I ran all of these as one-off tasks just now, and they put together useful reports; it'll be nice getting these on a daily/weekly basis. Claude Code has a Sentry connector that works in their cloud/web environment. That's cool; it accurately identified an issue I've been working on this week.

I might eventually try having these tasks open issues or even automatically address issues and open PRs, but we'll start with just reports for now.

NuclearPMMar 27, 2026, 5:36 AM

0 7 * * 1-5 ANTHROPIC_API_KEY=sk-... /path/to/claude-cron.sh /path/to/repo >> ~/claude-reports.md 2>&1

Seems trivial.

esperentMar 27, 2026, 6:46 AM

A trivial way to rack up hundreds of dollars in API costs, sure.

But you can set up a claude -p call via a cronjob without too much hassle and that can use subscriptions.

maccardMar 27, 2026, 1:30 PM

Sure, now what happens if my laptop is asleep at 7am? Or if our scheduled build took an extra 30 minutes because of contention?

chopete3Mar 27, 2026, 6:19 AM

Claude is moving fast.

https://grok.com/tasks

Grok has had this feature for some time now. I was wondering why others haven't done it yet.

This feature increases user stickiness. They give 10 concurrent tasks free.

I have had to extract specific news first thing in the morning across multiple sources.

sarpdagMar 27, 2026, 3:26 PM

I can't pick the effort for the tasks run on Claude Web. I have a feeling Claude is using low or medium effort on those tasks, and I observe clear quality differences with the task ran on my local claude code, which uses high effort.

cestivanMar 27, 2026, 3:29 PM

[dead]

mkageniusMar 27, 2026, 6:00 AM

This is a bit restrictive, doesn't take screenshots. So you can't "say take screenshots of my homepage and send it to me via email"

It doesnt allow egress curl, apart from few hardcoded domains.

I have created Cronbox in the cloud which has a better utility than above. Did a "Show HN: Cronbox – Schedule AI Agents" a few days back.

https://cronbox.sh

and a pelican riding a bicycle job -

https://cronbox.sh/jobs/pelican-rides-a-bicycle?variant=term...

0898Mar 27, 2026, 1:33 PM

One interesting restriction is that it won’t do anything with people’s faces.

I run conferences and I like to have photos of delegates on the page so you can see who else is attending.

I wanted to automate this by having Claude go to the person’s LinkedIn profile and save the image to the website.

But it seems it won’t do that because it’s been instructed not to.

wslhMar 27, 2026, 2:58 PM

LinkedIn already employs anti-scraping measures, so I'd expect a lot of users to get flagged.

That's not unique to LinkedIn but what is somewhat unique is the strong linkage to real world identities, which raises the cost of Sybil attacks on personal networks with high trust.

throwatdem12311Mar 27, 2026, 12:20 PM

So this is basically just Anthropic’s version of Open Claw that they manage for you and you pay them.

arjieMar 27, 2026, 5:05 AM

What's the per-unit-time compute cost (independent of tokens)? Compute deadline etc.? They don't charge for the Cloud Environment https://code.claude.com/docs/en/claude-code-on-the-web#cloud... currently running?

lucgaganMar 27, 2026, 5:22 AM

Here goes my project.

rhubarbtreeMar 27, 2026, 9:57 AM

Better idea. Watch online feedback on this feature. Then implement things users want. Go niche. Join the forum and help them use Claude to its limits. Then be the next step for power users.

hydroweaver87Mar 27, 2026, 6:24 AM

What were you working on?

pxtailMar 27, 2026, 11:10 AM

Welcome to Amazon playbook replayed again, most useful, profitable and popular use-cases will implemented by platform - and they will do it ruthlessly and quickly as money needs to be recouped.

dbvnMar 27, 2026, 1:29 PM

it would be easier to use claude to write a cronjob that does the same thing for you but accurately

qzncMar 27, 2026, 1:38 PM

And yet it probably covers 90% of what people use OpenClaw for.

pastel8739Mar 27, 2026, 5:02 AM

Is this free? I don’t see pricing info. I guess just a way to make you forget that you’re spending money on tokens?

weird-eye-issueMar 27, 2026, 5:25 AM

You don't spend money on tokens. It is a subscription.

mememememememoMar 27, 2026, 10:33 AM

The PHP script from a cron tab is back!

j1000Mar 27, 2026, 10:36 AM

lmao

PeterStuerMar 27, 2026, 7:18 AM

Is only Github supported as a repository?

SteinmarkMar 27, 2026, 5:16 PM

[dead]

maxbeechMar 27, 2026, 11:47 AM

[dead]

commers148Mar 27, 2026, 12:40 PM

[dead]

MeetRickAIMar 27, 2026, 6:49 AM

[dead]

jngiam1Mar 27, 2026, 5:38 AM

This is powerful. Combined with MCPs, you can pretty much automate a ton of work.

esperentMar 27, 2026, 6:46 AM

Can you give some examples?

adobrawyMar 27, 2026, 8:51 AM

That feature was silent launched about week ago for me.

I use it to:

- perform review of latest changes of code to update my documentation (security policies, user documentation etc.)

- perform review to latest changes of code, triage them, deduplicate and improve code - I review them, close them with comments for over-engoneering / add review for auto-fix

- perform review of open GitHub issue with label, select the one with highest impact, comment with rationale, implement it and make pull request - I wake up and I have a few pull request to fix issues that I can approve /finish in existing Claude Code thread

I want also use it to: - review recent Sentry issues, make GitHub issues for the one with highest priority, make pull request with proposed fix - I can just wake up and see that some crash is ready to be resolved

Limit of 3 scheduled jobs is pretty impactful, but playing with it give me a nice idea on how I can reduce my manual work.

Schedule tasks on the web

Comments