The implementation is extremely simple. For feature extraction part, I add three lines of code to the IDT (https://lear.inrialpes.fr/people/wang/improved_trajectories) to skip k frames for scale k. (while scale==k, read k frames, end). For coding and classification part, I use Vlfeat (http://www.vlfeat.org/applications/apps.html).