Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

He didn't give a great explanation. Just kinda kept dismissing it as a bad idea and suggesting other paths.

I don't see why using the Davinci outputs verbatim would be a problem in certain situations. The goal is just to get a fine-tuned cheaper model (like Curie) closer to Davinci performance in some narrow problem domain. Of course it's never going to be as good or broad as Davinci with this approach, but the lower cost may outweigh that. Just surprised more people haven't tried and benchmarked this approach...but I'm no expert here so there is probably a good reason.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: