Neuromorphic Bisable VLSI Synapses with Spike

Neuromorphic Bistable VLSI Synapses with
Spike-Timing-Dependent Plasticity
Giacomo Indiveri
Institute of Neuroinformatics
University/ETH Zurich
CH-8057 Zurich, Switzerland
[email protected]
Abstract
We present analog neuromorphic circuits for implementing bistable synapses with spike-timing-dependent plasticity (STDP) properties. In these
types of synapses, the short-term dynamics of the synaptic efficacies are
governed by the relative timing of the pre- and post-synaptic spikes,
while on long time scales the efficacies tend asymptotically to either a
potentiated state or to a depressed one. We fabricated a prototype VLSI
chip containing a network of integrate and fire neurons interconnected
via bistable STDP synapses. Test results from this chip demonstrate the
synapse’s STDP learning properties, and its long-term bistable characteristics.
1 Introduction
Most artificial neural network algorithms based on Hebbian learning use correlations of
mean rate signals to increase the synaptic efficacies between connected neurons. To prevent uncontrolled growth of synaptic efficacies, these algorithms usually incorporate also
weight normalization constraints, that are often not biophysically realistic. Recently an
alternative class of competitive Hebbian learning algorithms has been proposed based on a
spike-timing-dependent plasticity (STDP) mechanism [1]. It has been argued that the STDP
mechanism can automatically, and in a biologically plausible way, balance the strengths of
synaptic efficacies, thus preserving the benefits of both weight normalization and correlation based learning rules [16]. In STDP the precise timing of spikes generated by the
neurons play an important role. If a pre-synaptic spike arrives at the synaptic terminal before a post-synaptic spike is emitted, within a critical time window, the synaptic efficacy
is increased. Conversely if the post-synaptic spike is emitted soon before the pre-synaptic
one arrives, the synaptic efficacy is decreased.
While mean rate Hebbian learning algorithms are difficult to implement using analog circuits, spike-based learning rules map directly onto VLSI [4, 6, 7]. In this paper we present
compact analog circuits that, combined with neuromorphic integrate and fire (I&F) neurons
and synaptic circuits with realistic dynamics [8, 12, 11] implement STDP learning for short
time scales and asymptotically tend to one of two possible states on long time scales. The
circuits required to implement STDP, are described in Section 2. The circuits that implement bistability are described in Section 3. The network of I&F neurons used to measure
the properties of the bistable STDP synapse is described in Section 4.
Long term storage of synaptic efficacies
The circuits that drive the synaptic efficacy to one of two possible states on long time scales,
were implemented in order to cope with the problem of long term storage of analog values
in CMOS technology. Conventional VLSI capacitors, the devices typically used as memory
elements, are not ideal, in that they slowly loose the charge they are supposed to store, due
to leakage currents. Several solutions have been proposed for long term storage of synaptic
efficacies in analog VLSI neural networks. One of the first suggestions was to use the same
method used for dynamic RAM: to periodically refresh the stored value. This involves
though discretization of the analog value to N discrete levels, a method for comparing the
measured voltage to the N levels, and a clocked circuit to periodically refresh the value
on the capacitor. An alternative solution is to use analog-to-digital (ADC) converters, an
off chip RAM and digital-to-analog converters (DAC), but this approach requires, next
to a discretization of the value to N states, bulky ADC and DAC circuits. A more recent
suggestion is the one of using floating gate devices [5]. These devices can store very precise
analog values for an indefinite amount of time using standard CMOS technology [13], but
for spike-based learning rules they would require a control circuit (and thus large area) per
synapse. To implement dense arrays of neurons with large numbers of dendritic inputs the
synaptic circuits should be as compact as possible.
Bistable synapses
An alternative approach that uses a very small amount of area per synapse is to use bistable
synapses. These types of synapses contain minimum feature-size circuits that locally compare the value of the synaptic efficacy stored on the capacitor with a fixed threshold voltage
and slowly drive that value either toward a high analog voltage or toward a low one, depending on the output of the comparator (see Section 3).
The assumption that on long time scales the synaptic efficacy can only assume two values
is not too severe, for networks of neurons with large numbers of synapses. It has been
argued that also biological synapses can be indeed discrete on long time-scales. These
assumptions are compatible with experimental data [3] and are supported by experimental
evidence [15]. Also from a theoretical perspective it has been shown that the performance
of associative networks is not necessarily degraded if the dynamic range of the synaptic
efficacy is reduced even to the extreme (two stable states), provided that the transitions
between stable states are stochastic [2].
Related work
Bistable VLSI synapses in networks of I&F neurons have already been proposed in [6], but
in those circuits, the synaptic efficacy is always clamped to either a high value or a low one,
also for short-term dynamics, as opposed to our case, in which the synaptic efficacy can
assume any analog value between the two. In [7] the authors propose a spike-based learning circuit, based on a modified version of Riccati’s equation [10], in which the synaptic
efficacy is a continuous analog voltage; but their synapses require many more transistors
than the solution we propose, and do not incorporate long-term bistability. More recently
Bofill and Murray proposed circuits for implementing STDP within a framework of pulsebased neural network circuits [4]. But, next to missing the long-term bistability properties,
their synaptic circuits require digital control signals that cannot be easily generated within
the framework of neuromorphic networks of I&F neurons [8, 12].
Vdd
Vdd
M3
M4
Vtp
M2
Vdd
M10
M5
/post
Vpot
Ipot
Vw0
Vd
M6
M7
Vp
Cw
Idep
Vdep
pre
M11
M8
M1
M12
M9
Vtd
Figure 1: Synaptic efficacy STDP circuit.
2 The STDP circuits
The circuit required to implement STDP in a network of I&F neurons is shown in Fig. 1.
This circuit increases or decreases the analog voltage Vw0 , depending on the relative timing
of the pulses pre and /post. The voltage Vw0 is then used to set the strength of synaptic
circuits with realistic dynamics, of the type described in [11]. The pre- and post-synaptic
pulses pre and /post are generated by compact, low power I&F neurons, of the type described in [9].
The circuit of Fig. 1 is fully symmetric: upon the arrival of a pre-synaptic pulse pre a
waveform Vpot (t) (for potentiating Vw0 ) is generated. Similarly, upon the arrival of a
post-synaptic pulse /post, a complementary waveform Vdep (t) (for depotentiating Vw0 )
is generated. Both waveforms have a sharp onset and decay linearly with time, at a rate set
respectively by Vtp and Vtd . The pre- and post-synaptic pulses are also used to switch on
two gates (M 8 and M 5), that allow the currents Idep and Ipot to flow, as long as the pulses
are high, either increasing or decreasing the weight. The bias voltages V p on transistor M 6
and Vd on M 7 set an upper bound for the maximum amount of current that can be injected
into or removed from the capacitor Cw . If transistors M 4−M 9 operate in the subthreshold
regime [13], we can compute the analytical expression of Ipot (t) and Idep (t):
Ipot (t) =
Idep (t) =
e
I0
− Uκ Vpot (t−tpre )
T
+e
− Uκ Vp
(1)
T
I0
− Uκ Vdep (t−tpost )
(2)
− κ V
e T
+ e UT d
where tpre and tpost are the times at which the pre-synaptic and post-synaptic spikes are
emitted, UT is the thermal voltage, and κ is the subthreshold slope factor [13]. The change
in synaptic efficacy is then:
(
I
(t
)
∆Vw0 = potCppost ∆tspk
if tpre < tpost
(3)
Idep (tpre )
∆Vw0 = − Cd ∆tspk if tpost < tpre
where ∆tspk is the pre- and post-synaptic spike width, Cp is the parasitic capacitance of
node Vpot and Cd the one of node Vdep (not shown in Fig. 1).
In Fig. 2(a) we plot experimental data showing how ∆Vw0 changes as a function of ∆t =
tpre − tpost for different values of Vtd and Vtp . Similarly, in Fig. 2(b) we show plots
w0
0
−0.5
∆V
∆V
w0
(V)
0.5
(V)
0.5
−10
−5
0
∆ t (ms)
5
0
−0.5
10
−10
−5
(a)
0
∆ t (ms)
5
10
(b)
Figure 2: Changes in synaptic efficacy, as a function of the difference between pre- and
post-synaptic spike emission times ∆t = tpre −tpost . (a) Curves obtained for four different
values of Vpot (in the left quadrant) and four different values of Vdep (in the right quadrant).
(b) Typical STDP plot, obtained by setting Vp to 4.0V and Vd to 0.6V.
Vw0 (V)
1.5
0
0
5
2
3
4
5
0
0
5
1
2
3
4
5
0
0
1
2
3
Time (ms)
4
5
pre (V)
V
dep
(V)
1
Figure 3: Changes in Vw0 , in response to a sequence of pre-synaptic spikes (top trace). The
middle trace shows how the signal Vdep , triggered by the post-synaptic neuron, decreases
linearly with time. The bottom trace shows the series of digital pulses pre, generated with
every pre-synaptic spike.
of ∆Vw0 versus ∆t for three different values of Vp and three different values of Vd . As
there are four independent control biases, it is possible to set the maximum amplitude and
temporal window of influence independently for positive and negative changes in V w0 .
The data of Fig. 2 was obtained using a paired-pulse protocol similar to the one used in
physiological experiments [14]: one single pair of pre- and post-synaptic spikes was used
to measure each ∆Vw0 data point, by systematically changing the delay tpre − tpost and
by separating each stimulation session by a few hundreds of milliseconds (to allow the
signals to return to their resting steady-state). Unlike the biological experiments, in our
VLSI setup it is possible to evaluate the effect of multiple pulses on the synaptic efficacy,
for very long successive stimulation sessions, monitoring all the internal state variables
and signals involved in the process. In Fig. 3 we show the effect of multiple pre-synaptic
spikes, succeeding a post-synaptic one, plotting a trace of the voltage V w0 , together with the
Vhigh
M3
Vw0
−
Vthr
+
M4
M5
Vw0
M6
Vleak
M1
M2
Vlow
Figure 4: Bistability circuit. Depending on Vw0 − Vthr , the comparator drives Vw0 to either
Vhigh or Vlow . The rate at which the circuit drives Vw0 toward the asymptote is controlled
by Vleak and imposed by transistors M 2 and M 4.
“internal” signal Vdep , generated by the post-synaptic spike, and the pulses pre, generated
by the per-synaptic neuron. Note how the change in Vw0 is a positive one, when the postsynaptic spike follows a pre-synaptic one, at t = 0.5ms, and is negative when a series
of pre-synaptic spikes follows the post-synaptic one. The effect of subsequent pre pulses
following the first post-/pre-synaptic pair is additive, and decreases with time as in Fig. 2.
As expected, the anti-causal relationship between pre- and post-synaptic neurons has the
net effect of decreasing the synaptic efficacy.
3 The bistability circuit
The bistability circuit, shown in Fig. 4, drives the voltage Vw0 toward one of two possible
states: Vhigh (if Vw0 > Vthr ), or Vlow (if Vw0 < Vthr ). The signal Vthr is a threshold
voltage that can be set externally. The circuit comprises a comparator, and a mixed-mode
analog-digital leakage circuit. The comparator is a five transistor transconductance amplifier [13] that can be designed using minimum feature-size transistors. The leakage circuit
contains two gates that act as digital switches (M 5, M 6) and four transistors that set the
two stable state asymptotes Vhigh and Vlow and that, together with the bias voltage Vleak ,
determine the rate at which Vw0 approaches the asymptotes. The bistability circuit drives
Vw0 in two different ways, depending on how large is the distance between the value of V w0
itself and the asymptote. If |Vw0 −Vas | > 4UT the bistability circuit drives Vw0 toward Vas
linearly, where Vas represents either Vlow or Vhigh , depending on the sign of (Vw0 − Vthr ):
(
leak
t if Vw0 > Vthr
Vw0 (t) = Vw0 (0) + IC
w
(4)
Ileak
Vw0 (t) = Vw0 (0) − Cw t if Vw0 < Vthr
where Cw is the capacitor of Fig. 1 and
Ileak = I0 e
κVleak −Vlow
UT
As Vw0 gets close to the asymptote and |Vw0 −Vas | < 4UT , transistors M 2 or M 4 of Fig. 4
go out of saturation and Vw0 begins to approach the asymptote exponentially:
(
I
− leak t
Vw0 (t) = Vhigh − Vw0 (0)e Cw UT
if Vw0 > Vthr
(5)
Ileak
− Cw
t
UT
Vw0 (t) = Vlow + Vw0 (0)e
if Vw0 < Vthr
On long time scales the dynamics of Vw0 are governed by the bistability circuit, while on
short time-scales they are governed by the STDP circuits and the precise timing of pre- and
3
2
V
w0
(V)
2.5
1.5
1
0
2
4
6
Time (ms)
8
10
Figure 5: Synaptic efficacy bistability. Transition of Vw0 from below threshold to above
threshold (Vthr = 1.52V ), with leakage rate set by Vleak = 0.25V and pre- and postsynaptic neurons stimulated in a way to increase Vw0 .
I1
I2
M1
O1
M2
O2
Figure 6: Network of leaky I&F neurons with bistable STDP excitatory synapses and inhibitory synapses. The large circles symbolize I&F neurons, the small empty ones bistable
STDP excitatory synapses, and the small bars non-plastic inhibitory synapses. The arrows
in the circles indicate the possibility to inject current from an external source, to stimulate
the neurons.
post-synaptic spikes. If the STDP short-term dynamics drive Vw0 above threshold we say
that long-term potentiation (LTP) had been induced. And if the short-term dynamics drive
Vw0 below threshold, we say that long-term depression (LTD) has been induced.
In Fig. 5 we show how the synaptic efficacy Vw0 changes upon induction of LTP, while
stimulating the pre- and post-synaptic neurons with uniformly distributed spike trains. The
asymptote Vlow was set to zero, and Vhigh to 2.75V. The pre- and post-synaptic neurons
were injected with constant DC currents in a way to increase Vw0 , on average. As shown,
the two asymptotes Vlow and Vhigh act as two attractors, or stable equilibrium points,
whereas the threshold voltage Vthr acts as an unstable equilibrium point. If the synaptic efficacy is below threshold the short-term dynamics have to fight against the long-term
bistability effect, to increase Vw0 . But as soon as Vw0 crosses the threshold, the bistability
circuit switches, the effects of the short-term dynamics are reinforced by the asymptotic
drive, and Vw0 is quickly driven toward Vhigh .
4 A network of integrate and fire neurons
The prototype chip that we used to test the bistable STDP circuits presented in this paper,
contains a symmetric network of leaky I&F neurons [9] (see Fig. 6). The experimental data
w0
V
V
w0
(V)
4
(V)
4
4
6
8
10
0
0
2
2
4
6
8
10
0
0
2
4
6
Time (ms)
8
10
0
0
2
2
4
6
8
10
0
0
2
2
4
6
8
10
0
0
2
4
6
Time (ms)
8
10
pre (V)
pre (V)
post (V)
2
post (V)
0
0
2
(a)
(b)
Figure 7: Membrane potentials of pre- and post-synaptic neurons (bottom and middle traces
respectively) and synaptic efficacy values (top traces). (a) Changes in V w0 for low synaptic efficacy values (Vhigh = 2.1V) and no bistability leakage currents (Vleak = 0). (b)
Changes in Vw0 for high synaptic efficacy values (Vwh = 3.6V ) and with bistability asymptotic drive (Vleak = 0.25V).
of Figs. 2, 3, and 5 was obtained by injecting currents in the neurons labeled I1 and O1
and by measuring the signals from the excitatory synapse on O1. In Fig. 7 we show the
membrane potential of I1, O1, and the synaptic efficacy Vw0 of the corresponding synapse,
in two different conditions. Figure 7(a) shows the changes in Vw0 when both neurons are
stimulated but no asymptotic drive is used. As shown Vw0 strongly depends on the spike
patterns of the pre- and post-synaptic neurons. Figure 7(b) shows a scenario in which
only neuron I1 is stimulated, but in which the weight Vw0 is close to its high asymptote
(Vhigh = 3.6V) and in which there is a long-term asymptotic drive (Vleak = 0.25). Even
though the synaptic weight stays always in its potentiated state, the firing rate of O1 is not
as regular as the one of its efferent neuron. This is mainly due to the small variations of
Vw0 induced by the STDP circuit.
5 Discussion and future work
The STDP circuits presented here introduce a source of variability in the spike timing of the
I&F neurons that could be exploited for creating VLSI networks of neurons with stochastic
dynamics and for implementing spike-based stochastic learning mechanisms [2]. These
mechanisms rely on the variability of the input signals (e.g. of Poisson distributed spike
trains) and on their precise spike-timing in order to induce LTP or LTD only to a small
specific sub-set of the synapses stimulated. In future experiments we will characterize the
properties of the bistable STDP synapse in response to Poisson distributed spike trains, and
measure transition probabilities as functions of input statistics and circuit parameters.
We presented compact neuromorphic circuits for implementing bistable STDP synapses in
VLSI networks of I&F neurons, and showed data from a prototype chip. We demonstrated
how these types of synapses can either store their LTP or LTD state for long-term, or switch
state depending on the precise timing of the pre- and post-synaptic spikes. In the near
future, we plan to use the simple network of I&F neurons of Fig. 6, present on the prototype
chip, to analyze the effect of bistable STDP plasticity at a network level. On the long term,
we plan to design a larger chip with these circuits to implement a re-configurable network
of I&F neurons of O(100) neurons and O(1000) synapses, and use it as a real-time tool for
investigating the computational properties of competitive networks and selective attention
models.
Acknowledgments
I am grateful to Rodney Douglas and Kevan Martin for their support, and to Shih-Chii Liu
and Stefano Fusi for constructive comments on the manuscript. Some of the ideas that led
to the design and implementation of the circuits presented were inspired by the Telluride
Workshop on Neuromorphic Engineering (http://www.ini.unizh.ch/telluride).
References
[1] L. F. Abbott and S. Song. Asymmetric hebbian learning, spike liming and neural response
variability. In Advances in Neural Information Processing Systems, volume 11, pages 69–75,
1998.
[2] D. J. Amit and S. Fusi. Dynamic learning in neural networks with material synapses. Neural
Computation, 6:957, 1994.
[3] T. V. P. Bliss and G. L. Collingridge. A synaptic model of memory: Long term potentiation in
the hippocampus. Nature, 31:361, 1993.
[4] A. Bofill and A.F. Murray. Circuits for VLSI implementation of temporally asymmetric Hebbian learning. In T. G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural
Information processing systems, volume 14. MIT Press, Cambridge, MA, 2001.
[5] C. Diorio, P. Hasler, B.A. Minch, and C. Mead. A single-transistor silicon synapse. IEEE Trans.
Electron Devices, 43(11):1972–1980, 1996.
[6] S. Fusi, M. Annunziato, D. Badoni, A. Salamon, and D.J. Amit. Spike-driven synaptic plasticity: theory, simulation, VLSI implementation. Neural Computation, 12:2227–2258, 2000.
[7] P. Häfliger, M. Mahowald, and L. Watts. A spike based learning neuron in analog VLSI. In
M. C. Mozer, M. I. Jordan, and T. Petsche, editors, Advances in neuralinformation processing
systems, volume 9, pages 692–698. MIT Press, 1997.
[8] G. Indiveri. Modeling selective attention using a neuromorphic analog VLSI device. Neural
Computation, 12(12):2857–2880, December 2000.
[9] G. Indiveri. A low-power adaptive integrate-and-fire neuron circuit. In ISCAS 2003. The 2003
IEEE International Symposium on Circuits and Systems, 2003. IEEE, 2003.
[10] T. Kohonen. Self-Organization and Associative Memory. Springer Series in Information Sciences. Springer Verlag, 2nd edition, 1988.
[11] S.-C. Liu, M. Boegerhausen, and S. Pascal. Circuit model of short-term synaptic dynamics. In
Advances in Neural Information Processing Systems, volume 15, Cambridge, MA, December
2002. MIT Press.
[12] S.-C. Liu, J. Kramer, G. Indiveri, T. Delbruck, T. Burg, and R. Douglas. Orientation-selective
aVLSI spiking neurons. Neural Networks, 14(6/7):629–643, 2001. Special Issue on Spiking
Neurons in Neuroscience and Technology.
[13] S.-C. Liu, J. Kramer, G. Indiveri, T. Delbruck, and R. Douglas. Analog VLSI:Circuits and
Principles. MIT Press, 2002.
[14] H. Markram, J. Lübke, M. Frotscher, and B. Sakmann. Regulation of synaptic efficacy by
coincidence of postsynaptic APs and EPSPs. Science, 275:213–215, 1997.
[15] C. C. H. Petersen, R. C. Malenka, R. A. Nicoll, and J. J. Hopfield. All-ornone potentiation at
CA3-CA1 synapses. Proc. Natl. Acad. Sci., 95:4732, 1998.
[16] S. Song, K. D. Miller, and L. F. Abbot. Competitive Hebbian learning through spike-timingdependent plasticity. Nature Neuroscience, 3(9):919–926, 2000.