Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

portfolio

Student Capacity Moderates Knowledge Distillation Effectiveness

Systematic study of knowledge distillation across ResNet teacher-student pairs on CIFAR-10. Identifies student capacity — not teacher-student accuracy gap — as the key moderating factor in KD effectiveness. Accompanied by a peer-reviewed arXiv publication.

publications

Student Capacity Moderates Knowledge Distillation Effectiveness: A Systematic Study Across ResNet Teacher-Student Pairs on CIFAR-10

Published in arXiv, 2026

Systematic ablation study comparing Logit-KD and Feature-KD across ResNet teacher-student pairs on CIFAR-10. Key findings: student capacity — not teacher-student accuracy gap — is the primary moderator of KD effectiveness; and implementation correctness critically affects Feature-KD performance.

Recommended citation: Yaşar, U. O. (2026). Student Capacity Moderates Knowledge Distillation Effectiveness. arXiv:2605.31191.
Download Paper

talks

teaching