Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
M Almenea's picture
2 2

M Almenea

malmenea
ยท
  • almenea

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago
Executable Code Actions Elicit Better LLM Agents
liked a Space 4 months ago
nanotron/ultrascale-playbook
reacted to loubnabnl's post with ๐Ÿค— over 1 year ago
We've just published a detailed blog post on the creation of Cosmopedia dataset. We hope this will provide insights about generating synthetic data at scale for pre-training. https://huggingface.co/blog/cosmopedia Here are some key takeaways: ๐ŸŽฏ Prompt curation is crucial: we want to cover many topics with few duplicates. ๐Ÿ“š You can leverage various resources for diversity: using different seed data, generation formats, and target audiences. โš™๏ธ The importance of a good technical stack: for scalable generations with tools like llm-swarm and fast model training and evaluation. Have a good read!
View all activity

Organizations

Hugging Face MCP Course's profile picture

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs