Yann Chene PRO
Yann-CV
AI & ML interests
computer vision - anomaly detection - data orchestration - data efficient training & monitoring - smart sampling - smart labeling - smart balancing
Recent Activity
posted an update 2 days ago
๐ฆ Goldener feature: Semantics aware sampling for better models
Goldener provides smart data sampling out of the box by combining 2 different GoldDoers (classes orchestrating data actions):
1๏ธโฃ GoldDescriptor: Unlock data semantics access via embeddings computed from foundation models.
2๏ธโฃ GoldSelector: Select samples automatically by digging into data semantics with coreset algorithms
Both the foundation model and coreset algorithm are fully customizable to achieve the selection goals from a few lines of Python code.
The result? Goldener can replace the usual random selection and help release better models, faster!
๐ More details: https://huggingface.co/blog/Yann-CV/goldener-smart-sampling
๐จ Give it a try: pip install goldener
published an article 29 days ago
GOLDENER <br> Smart batching from embeddings and clustering published an article about 1 month ago
GOLDENER <br> Smart sampling from embeddings and coreset selection