首页>
外国专利>
CNN 3x3 1x3 3x1 CNN LOW OVERHEAD IMPLEMENTATION OF WINOGRAD FOR CNN WITH 3x3 1x3 AND 3x1 FILTERS ON WEIGHT STATION DOT-PRODUCT BASED CNN ACCELERATORS
CNN 3x3 1x3 3x1 CNN LOW OVERHEAD IMPLEMENTATION OF WINOGRAD FOR CNN WITH 3x3 1x3 AND 3x1 FILTERS ON WEIGHT STATION DOT-PRODUCT BASED CNN ACCELERATORS
A system and method for forming an output feature map (OFM) is disclosed. Activation values of the input feature map (IFM) are selected and transformed on-the-fly into the Winograd domain. Elements of the Winograd filter each corresponding to the transformed activation values are selected. The transformed activation value is multiplied by the corresponding element of the Winograd filter to form the corresponding product value in the Winograd domain. Activation values are iteratively selected, transformed and multiplied by the corresponding element of the Winograd filter to form the corresponding product value in the Winograd domain until all activation values of the IFM are transformed and multiplied by the corresponding element. The product values are summed in the Winograd domain to form an element of the feature map in the Winograd domain. Elements of the feature map in the Winograd domain are transformed on the fly inverse-winograd to form an OFM.
展开▼