15 Pages
English

Programmable Active Memories

Gain access to the library to view online
Learn more

Description

Niveau: Supérieur, Doctorat, Bac+8
Programmable Active Memories: a Performance Assessment Patrice Bertin, Didier Roncin and Jean Vuillemin Digital Equipment Corporation Paris Research Laboratory 85, avenue Victor-Hugo 92563 Rueil Malmaison Cedex, France Abstract We present some quantitative performance measurements for the comput- ing power of Programmable Active Memories (PAM), as introduced by [2]. Based on Field Programmable Gate Array (FPGA) technology, the PAM is a universal hardware co-processor closely coupled to a standard host com- puter. The PAM can speed up many critical software applications running on the host, by executing part of the computations through a specic hard- ware design. The performance measurements presented are based on two PAM architectures and ten specic applications, drawn from arithmetics, algebra, geometry, physics, biology, audio and video. Each of these PAM designs proves as fast as any reported hardware or super-computer for the corresponding application. In cases where we could bring some genuine al- gorithmic innovation into the design process, the PAM has proved an order of magnitude faster than any previously existing system (see [19] and [18]). 1 PAM concept Like any RAM memory module, a PAM is attached to the system bus of a host computer. The processor can write into, and read from the PAM.

  • design into

  • ram

  • hardware design

  • hardware

  • clock speed

  • software

  • up many critical


Subjects

Informations

Published by
Reads 16
Language English

Programmable
Activ
e
an
an
is
Memories:
a
a
AM.
P
to
erformance
e
Assessmen
y
t
of
P
pro
atrice
host
Bertin,
are
Didier
of
Roncin
e
and
signican
Jean
[18
V
AM
uillemin
write
Digital
cessor
Equipmen
and
t
of
Corp
b
oration
sp
P
exe-
aris
in
Researc
ted
h
er
Lab
k
oratory
sp
85,
soft
a
(see
v
AM
en
mo
ue
to
Victor-Hugo
The
92563
read
Rueil
e
Malmaison
er,
Cedex,
et
F
sp
rance
the
Abstract
ation
W
the
e
a
presen
few
t
sp
some
the
quan
al
titativ
design
e
AM.
p
and
erformance
In
measuremen
attempted
ts
application
for
h,
the
frequen
comput-
sp
ing
design
p
of
o
hardw
w
existing
er
]
of
1
Programm
Lik
able
RAM
Activ
a
e
attac
Memories
system
(P
host
AM),
cessor
as
to,
in
the
tro
an
duced
are
b
w
y
P
[2
data
].
een
Based
instructions.
on
pro
Field
b
Programm
ten
able
c
Gate
.
Arra
c
y
AM
(FPGA)
downlo
tec
design
hnology
within
,
W
the
up
P
soft
AM
running
is
b
a
its
univ
op
ersal
hardw
hardw
wn-
are
the
co-pro
en
cessor
h
closely
are
coupled
elo
to
them,
a
v
standard
co
host
man
com-
as
puter.
In
The
e
P
basic
AM
problems,
can
inner-lo
sp
through
eed
P
up
in
man
sp
y
application
critical
are
soft
com
w
previously
are
system
applications
[19
running
and
on
]).
the
P
host,
concept
b
e
y
y
executing
memory
part
dule,
of
P
the
is
computations
hed
through
the
a
bus
sp
a
ecic
computer.
hard-
pro
w
can
are
in
design.
and
The
from
p
P
erformance
Being
measuremen
activ
ts
hardw
presen
co-pro
ted
ho
are
ev
based
the
on
AM
t
cesses
w
b
o
w
P
write
AM
read
arc
The
hitectures
ecic
and
cessing
ten
determined
sp
y
ecic
con
applications,
t
dra
its
wn
ongur
from
memory
arithmetics,
The
algebra,
can
geometry
hange
,
P
ph
conguration
ysics,
y
biology
ading
,
new
audio
in
and
it,
video.
a
Eac
milliseconds.
h
e
of
eed
these
a
P
ecic
AM
w
designs
application
pro
on
v
host,
es
y
as
cuting
fast
critic
as
inner-lo
an
through
y
appropriate
rep
are
orted
do
hardw
loaded
are
to
or
P
sup
T
er-computer
examples
for
suc
the
designs
corresp
applications
onding
presen
application.
b
In
w.
cases
selecting
where
w
w
ha
e
e
could
to
bring
v
some
as
gen
y
uine
areas
al-
w
gorithmic
could.
inno
eac
v
w
ation
pic
in
ed
to
and
the
t
design
where
pro
large
cess,
op
the
eedup
P
a
AM
ecic
has
AM
pro
results
v
a
ed
t
an
eedup
order
the
of
system,
magnitude
w
faster
and
than
are
an
bined.F
or
the
o
external
er
sak
h
e
useful
of
n
debugging,
h
the
hardw
functionalit
er
y
k
of
oth
eac
osed
h
is
hardw
resp
are
(hereafter
design
banks
is
the
matc
op
hed,
to
bit-wise
F
at
is
the
bits
I/O
computer
lev
e
el,
o
b
arc
y
DECP
a
and
soft
arra
w
P
are
y
implemen
fast
tat
generator,
ion.
wnload
Through
ops
successiv
second)
e
op
re