A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

doi:10.60692/6ja1d-0qn91

Published November 16, 2021 | Version v1

Publication Open

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

1. Peking University
2. Institute of Microelectronics

Artificial neural networks (ANNs), like convolutional neural networks (CNNs), have achieved the state-of-the-art results for many machine learning tasks. However, inference with large-scale full-precision CNNs must cause substantial energy consumption and memory occupation, which seriously hinders their deployment on mobile and embedded systems. Highly inspired from biological brain, spiking neural networks (SNNs) are emerging as new solutions because of natural superiority in brain-like learning and great energy efficiency with event-driven communication and computation. Nevertheless, training a deep SNN remains a main challenge and there is usually a big accuracy gap between ANNs and SNNs. In this paper, we introduce a hardware-friendly conversion algorithm called "scatter-and-gather" to convert quantized ANNs to lossless SNNs, where neurons are connected with ternary {-1,0,1} synaptic weights. Each spiking neuron is stateless and more like original McCulloch and Pitts model, because it fires at most one spike and need be reset at each time step. Furthermore, we develop an incremental mapping framework to demonstrate efficient network deployments on a reconfigurable neuromorphic chip. Experimental results show our spiking LeNet on MNIST and VGG-Net on CIFAR-10 datasetobtain 99.37% and 91.91% classification accuracy, respectively. Besides, the presented mapping algorithm manages network deployment on our neuromorphic chip with maximum resource efficiency and excellent flexibility. Our four-spike LeNet and VGG-Net on chip can achieve respective real-time inference speed of 0.38 ms/image, 3.24 ms/image, and an average power consumption of 0.28 mJ/image and 2.3 mJ/image at 0.9 V, 252 MHz, which is nearly two orders of magnitude more efficient than traditional GPUs.

Translated Descriptions

This is an automatic machine translation with an accuracy of 90-95%

Translated Description (Arabic)

حققت الشبكات العصبية الاصطناعية (ANNs)، مثل الشبكات العصبية الالتفافية (CNNs)، أحدث النتائج للعديد من مهام التعلم الآلي. ومع ذلك، يجب أن يتسبب الاستدلال على شبكات CNN كاملة الدقة واسعة النطاق في استهلاك كبير للطاقة واحتلال الذاكرة، مما يعوق بشكل خطير نشرها على الأنظمة المحمولة والمدمجة. مستوحاة بشكل كبير من الدماغ البيولوجي، تظهر الشبكات العصبية المتصاعدة (SNNs) كحلول جديدة بسبب التفوق الطبيعي في التعلم الشبيه بالدماغ وكفاءة الطاقة الكبيرة مع التواصل والحساب القائم على الأحداث. ومع ذلك، لا يزال تدريب SNN العميق يمثل تحديًا رئيسيًا وعادة ما تكون هناك فجوة كبيرة في الدقة بين ANNs و SNNs. في هذه الورقة، نقدم خوارزمية تحويل صديقة للأجهزة تسمى "التشتت والجمع" لتحويل ANNs الكمية إلى SNNs عديمة الخسارة، حيث ترتبط الخلايا العصبية بالأوزان المتشابكة {-1,0,1} الثلاثية. كل خلية عصبية متصاعدة هي عديمة الحالة وتشبه نموذج مكولوتش وبيتس الأصلي، لأنها تطلق ارتفاعًا واحدًا على الأكثر وتحتاج إلى إعادة ضبط في كل خطوة زمنية. علاوة على ذلك، نقوم بتطوير إطار رسم خرائط تدريجي لإظهار عمليات نشر فعالة للشبكة على شريحة عصبية قابلة لإعادة التشكيل. تظهر النتائج التجريبية أن شبكة LeNet الخاصة بنا على MNIST و VGG - Net على بيانات CIFAR -10 تحصل على دقة تصنيف 99.37 ٪ و 91.91 ٪ على التوالي. إلى جانب ذلك، تدير خوارزمية رسم الخرائط المعروضة نشر الشبكة على رقاقتنا العصبية بأقصى قدر من الكفاءة في استخدام الموارد ومرونة ممتازة. يمكن لشبكة LeNet و VGG - Net رباعية الارتفاعات على الشريحة تحقيق سرعة استدلال في الوقت الفعلي تبلغ 0.38 مللي ثانية/صورة، و 3.24 مللي ثانية/صورة، ومتوسط استهلاك للطاقة يبلغ 0.28 مللي جول/صورة و 2.3 مللي جول/صورة عند 0.9 فولت، 252 ميجا هرتز، وهو ما يقرب من اثنين من أوامر الحجم أكثر كفاءة من وحدات معالجة الرسومات التقليدية.

Translated Description (French)

Les réseaux de neurones artificiels (RNA), comme les réseaux de neurones convolutionnels (RNC), ont obtenu des résultats de pointe pour de nombreuses tâches d'apprentissage automatique. Cependant, l'inférence avec des CNN de haute précision à grande échelle doit entraîner une consommation d'énergie et une occupation de la mémoire substantielles, ce qui entrave sérieusement leur déploiement sur les systèmes mobiles et embarqués. Très inspirés du cerveau biologique, les réseaux de neurones dopants (SNN) apparaissent comme de nouvelles solutions en raison de leur supériorité naturelle dans l'apprentissage cérébral et de leur grande efficacité énergétique avec une communication et un calcul pilotés par les événements. Néanmoins, la formation d'un SNN profond reste un défi majeur et il existe généralement un grand écart de précision entre les ANN et les SNN. Dans cet article, nous introduisons un algorithme de conversion compatible avec le matériel appelé « scatter-and-gather » pour convertir les ANN quantifiés en SNN sans perte, où les neurones sont connectés à des poids synaptiques ternaires {-1,0,1}. Chaque neurone dopant est apatride et ressemble davantage au modèle original de McCulloch et Pitts, car il déclenche au plus un pic et doit être réinitialisé à chaque pas de temps. En outre, nous développons un cadre de cartographie incrémentielle pour démontrer des déploiements de réseau efficaces sur une puce neuromorphique reconfigurable. Les résultats expérimentaux montrent que notre LeNet de pointe sur MNIST et VGG-Net sur l'ensemble de données CIFAR-10 obtiennent respectivement 99,37 % et 91,91 % de précision de classification. En outre, l'algorithme de mappage présenté gère le déploiement du réseau sur notre puce neuromorphique avec une efficacité maximale des ressources et une excellente flexibilité. Notre LeNet à quatre pics et notre VGG-Net sur puce peuvent atteindre une vitesse d'inférence en temps réel respective de 0,38 ms/image, 3,24 ms/image et une consommation d'énergie moyenne de 0,28 mJ/image et 2,3 mJ/image à 0,9 V, 252 MHz, ce qui est près de deux ordres de grandeur plus efficace que les GPU traditionnels.

Translated Description (Spanish)

Las redes neuronales artificiales (RNA), como las redes neuronales convolucionales (CNN), han logrado los resultados más avanzados para muchas tareas de aprendizaje automático. Sin embargo, la inferencia con CNN de precisión completa a gran escala debe causar un consumo de energía y una ocupación de memoria sustanciales, lo que dificulta seriamente su despliegue en sistemas móviles e integrados. Altamente inspiradas en el cerebro biológico, las redes neuronales de punta (SNN) están surgiendo como nuevas soluciones debido a la superioridad natural en el aprendizaje similar al cerebro y la gran eficiencia energética con la comunicación y la computación basadas en eventos. Sin embargo, la capacitación de una SNN profunda sigue siendo un desafío principal y, por lo general, existe una gran brecha de precisión entre las Ann y las SNN. En este artículo, presentamos un algoritmo de conversión compatible con el hardware llamado "scatter-and-gather" para convertir las RNA cuantificadas en SNN sin pérdida, donde las neuronas están conectadas con pesos sinápticos {-1,0,1} ternarios. Cada neurona de punta es apátrida y se parece más al modelo original de McCulloch y Pitts, porque dispara como máximo una punta y debe restablecerse en cada paso de tiempo. Además, desarrollamos un marco de mapeo incremental para demostrar implementaciones de red eficientes en un chip neuromórfico reconfigurable. Los resultados experimentales muestran que nuestro pico de LeNet en MNIST y VGG-Net en la base de datos CIFAR-10 obtiene una precisión de clasificación del 99.37% y 91.91%, respectivamente. Además, el algoritmo de mapeo presentado gestiona el despliegue de la red en nuestro chip neuromórfico con la máxima eficiencia de recursos y una excelente flexibilidad. Nuestra LeNet de cuatro puntas y VGG-Net en chip pueden lograr una velocidad de inferencia en tiempo real respectiva de 0.38 ms/imagen, 3.24 ms/imagen y un consumo de energía promedio de 0.28 mJ/imagen y 2.3 mJ/imagen a 0.9 V, 252 MHz, que es casi dos órdenes de magnitud más eficiente que las GPU tradicionales.

Files

pdf.pdf

Files (2.4 MB)

Please wait a few minutes before your translated files are ready Note: Some files might be protected thus translations might not work.

Name	Size	Download all
pdf.pdf md5:9dc13447404c924f2c03efa2e896e7fa	2.4 MB	Preview Download

Additional details

Translated title (Arabic): شبكة عصبية التفافية متناثرة على جهاز عصبي قابل لإعادة التشكيل
Translated title (French): Un réseau neuronal convolutif à enrichissement dispersé sur un matériel neuromorphe reconfigurable
Translated title (Spanish): Una red neuronal convolucional puntiaguda de dispersión y reunión en un hardware neuromórfico reconfigurable

Other: https://openalex.org/W3211535359
DOI: 10.3389/fnins.2021.694170

Is Global South Knowledge: Yes
Country: China

https://openalex.org/W1498436455
https://openalex.org/W1604973310
https://openalex.org/W1625518883
https://openalex.org/W1861492603
https://openalex.org/W2060956313
https://openalex.org/W2076964542
https://openalex.org/W2112796928
https://openalex.org/W2115831804
https://openalex.org/W2117539524
https://openalex.org/W2163288878
https://openalex.org/W2194775991
https://openalex.org/W2300242332
https://openalex.org/W2314470091
https://openalex.org/W2513853720
https://openalex.org/W2569813014
https://openalex.org/W2621826044
https://openalex.org/W2758121337
https://openalex.org/W2775079417
https://openalex.org/W2783525259
https://openalex.org/W2798878556
https://openalex.org/W2800613970
https://openalex.org/W2909235141
https://openalex.org/W2922220370
https://openalex.org/W2966552556
https://openalex.org/W3004127905
https://openalex.org/W3006426821
https://openalex.org/W3007283957
https://openalex.org/W3007981140
https://openalex.org/W3012561096
https://openalex.org/W3118707936
https://openalex.org/W3124237980
https://openalex.org/W3131852626
https://openalex.org/W4299967799

	All versions	This version
Views	2	2
Downloads	2	2
Data volume	4.7 MB	4.7 MB

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

Translated Descriptions

Translated Description (Arabic)

Translated Description (French)

Translated Description (Spanish)

Files

pdf.pdf

Files (2.4 MB)

Additional details

Additional titles

Identifiers

Related works

GreSIS Basics Section

References

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

Creators

Description

Translated Descriptions

Translated Description (Arabic)

Translated Description (French)

Translated Description (Spanish)

Files

pdf.pdf

Files (2.4 MB)

Additional details

Additional titles

Identifiers

Related works

GreSIS Basics Section

References