Discussion about this post

User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

This article comes at the perfect time. Building on your prior work on efficient inference, are you exploring methods to furhter compress these models?

Expand full comment

No posts

Ready for more?