How many layers to fine-tune? Model fine-tuning allows you — @neural_network_engineering

@neural_network_engineering2.5K подп.

8.5Kпросмотров

31 августа 2022 г.

questionScore: 9.4K

How many layers to fine-tune? Model fine-tuning allows you to improve the quality of the pre-trained models with just a fraction of the resources spent on training the original model. But there is a trade-off between the number of layers you tune and the precision you get. Using fewer layers allows for faster training with a larger batch size, while more layers increase the model's capacity. We've done experiments so you can make more educated choices. Highlights: - Training only the head of a model (5% of weights) gives x2 boost on metrics, while full training gives only x3. - Training only a head layer allows using larger models with bigger batch sizes, compensating for the precision. - If you only have a small dataset, full model tuning will give a more negligible effect

8.5K

просмотров

796

символов

Нет

эмодзи

Нет

медиа

Другие посты @neural_network_engineering

One of the main features of the framework is caching. It allows you to infer large models only once👁 7.8K Triplet loss - Advanced Intro Loss functions in metric learning are all chasing the same goal - t👁 7.3K Similarity Learning lacks a framework. So we built one. Many general-purpose frameworks allow you t👁 7.0K Metric Learning for Anomaly Detection Anomaly detection is one of those tasks to which it is challe👁 6.4K For those who reacted with 🦀 on a previous post. I wrote a Twitter thread on how I am building Qdran👁 6.2K

Все посты канала →

Аналитика канала База постов