Hasnain Reads

Hasnain says:

Just came across this and bookmarking for future re reading, for.. reasons

“I think there is a tremendous opportunity for the open source community to really push the state of the art in coding evaluations. There's only so many benchmarks that I, personally, can create, but if everyone is making benchmarks I could eventually imagine a universe of benchmarks where you could curate the problems that are relevant to your work and quickly and cheaply judge models in this way: a Wikipedia of Coding Benchmarks.”

Posted on 2025-05-19T06:40:26+0000