“…While traditionally this would have required N S 2 N T times of multiply-and-accumulation (MAC) operations per pixel per frame ( N S and N T are filter sizes along space and time dimensions, respectively), a separable implementation can be much more efficient. Given the fact that the horizontal or vertical components of the optical flow can be computed independently from the separate horizontal or vertical motion energy channels, the 3D spatiotemporal filter can be decomposed into cascaded spatial and temporal filters [14,25]. This way, the horizontal and vertical motion energy feature maps ME X and ME Y for different spatiotemporal tuning frequencies ( f X/Y , f T ) are extracted as:ITfalse(x,y,t;fTfalse)=Ifalse(x,y,tfalse)∗Gaborfalse(t;fTfalse),MEXfalse(x,y,t;fS,fTfalse)=|ITfalse(x,y,t;fTfalse)∗Gaussfalse(yfalse)∗Gaborfalse(x;fXfalse)|2,MEYfalse(x,y,t;fS,fTfalse)=|ITfalse(x,y,t;fTfalse)∗Gaussfalse(xfalse)∗G…”