Singularity@lemmy.world · 3 years ago

An energy-efficient analog chip for AI inference | IBM Research Blog

research.ibm.com

An energy-efficient analog chip for AI inference | IBM Research Blog

research.ibm.com

megaman1970@lemmy.world to

Singularity@lemmy.world · 3 years ago

IBM Research has been investigating ways to reinvent the way that AI is computed. Analog in-memory computing, or simply analog AI, is a promising approach to address the challenge by borrowing key features of how neural networks run in biological brains. In our brains, and those of many other animals, the strength of synapses (which are the “weights” in this case) determine communication between neurons. For analog AI systems, we store these synaptic weights locally in the conductance values of nanoscale resistive memory devices such as phase change memory (PCM) and perform multiply-accumulate (MAC) operations, the dominant compute operation in DNNs by exploiting circuit laws and mitigating the need to constantly send data between memory and processor.

Paper

A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference

Abstract

Analogue in-memory computing (AIMC) with resistive memory devices could reduce the latency and energy consumption of deep neural network inference tasks by directly performing computations within memory. However, to achieve end-to-end improvements in latency and energy consumption, AIMC must be combined with on-chip digital operations and on-chip communication. Here we report a multicore AIMC chip designed and fabricated in 14 nm complementary metal–oxide–semiconductor technology with backend-integrated phase-change memory. The fully integrated chip features 64 AIMC cores interconnected via an on-chip communication network. It also implements the digital activation functions and additional processing involved in individual convolutional layers and long short-term memory units. With this approach, we demonstrate near-software-equivalent inference accuracy with ResNet and long short-term memory networks, while implementing all the computations associated with the weight layers and the activation functions on the chip. For 8-bit input/output matrix–vector multiplications, in the four-phase (high-precision) or one-phase (low-precision) operational read mode, the chip can achieve a maximum throughput of 16.1 or 63.1 tera-operations per second at an energy efficiency of 2.48 or 9.76 tera-operations per second per watt, respectively.

You must log in or # to comment.

Chat

SubArcticTundra@lemmy.ml
link
fedilink
arrow-up
2·
3 years ago
Does going analogue eliminate there being a limit on the number of operations you can do a second (ie clock rate)?
megaman1970@lemmy.worldOP
link
fedilink
arrow-up
1·
3 years ago
Here is an Ars Technica article on the technology.
possibly a cat@lemmy.mlB
link
fedilink
arrow-up
1·
edit-2
2 years ago
deleted by creator

Singularity@lemmy.world

singularity@lemmy.world

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !singularity@lemmy.world

The technological singularity—or simply the singularity—is a hypothetical future point in time at which technological growth becomes uncontrollable and irreversible, resulting in unforeseeable changes to human civilization. According to the most popular version of the singularity hypothesis, I. J. Good’s intelligence explosion model, an upgradable intelligent agent will eventually enter a “runaway reaction” of self-improvement cycles, each new and more intelligent generation appearing more and more rapidly, causing an “explosion” in intelligence and resulting in a powerful superintelligence that qualitatively far surpasses all human intelligence.

— Wikipedia

This is a community for discussing theoretical and practical consequences related to the singularity, or any other innovation in the realm of machine learning capable of potentially disrupting our society.

You can share news, research papers, discussions and opinions. This community is mainly meant for information and discussion, so entertainment (such as memes) should generally be avoided, unless the content is thought-provoking or has some other qualities.

Rules:

Be nice to everyone, even if you disagree.
No spam. No ads.
No NSFW.
Self-promotion is acceptable if not excessive (i.e. no spam).

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

2 users / day
2 users / week
2 users / month
1 user / 6 months
0 local subscribers
224 subscribers
45 Posts
12 Comments
Modlog

mods:
Drew Got No Clue@lemmy.world