non-wafer-scale sieving hardware for the nfs: another attempt … · 2007-06-12 · 23.5.2007...

Post on 27-Feb-2020

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Non-Wafer-Scale SievingHardware for the NFS:

Another Attempt to Copewith 1024-bit

Willi GeiselmannRainer Steinwandt

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 2

The Number Field Sieve

Precomputation Relation collection Linear Algebra (Matrix step) Postprocessing

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 3

Relation Collection

Given F1(x,y), F2(x,y) ∈ Z[x,y] ,homogeneous polynomials,e.g. of degree 5 and 1

Find (a,b) ∈ Z x N with F1(a,b) andF2(a,b) smooth, gcd(a,b) = 1

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 4

Parameters for 1024 Bit(identical with TWIRL, 2003)

Smoothness bounds:B1 = 2.6 • 1010 (algebraic),B2 = 3.5 • 109 (rational).

Sieving region:A = 5.5 • 1014, -A < a < A;B = 2.7 • 108, 0 < b < B.

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 5

Previous work

TWINKLE [Shamir 1999; Shamir, Lenstra 2000]not designed for 1024 bit numbers

TWIRL [Shamir, Tromer 2003]full wafer design

Mesh-based sieving [G., St. 2003, 2004]not feasible for 1024 bit numbers

SHARK [Franke et al. 2005]elaborated butterfly transport system

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 6

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2223571113171923293137

Sieving (Eratosthenes)

Memory

Ti

me

Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ ΣΣ Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 7

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2223571113171923293137

Sieving (TWINKLE/TWIRL)

Time

Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ ΣΣ Σ Σ Σ Σ Σ Σ Σ Σ Σ Σ

CounterCounterCounterCounterCounterCounterCounterCounterCounterCounterCounterCounter

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 8

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2223571113171923293137

Sieving (mesh / here)

Memory

Σ Σ Σ Σ Σ Σ Σ ΣΣ Σ Σ Σ Σ Σ Σ Σ

?

??????

CounterCounterCounter

?

?

(S = 226)

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 9

Different Types of Primes

Largish primes I: 227.2 < p < B1 < 235

...Type II/III: 1.5 • 107 < p < 227.2

Medium primes: 213 < p < 1.5 • 107

Smallish primes: p < 213

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 10

Largish Stations

. . .

DRAM for (p, r)-pairs

−A ≤ r < −A + S

control logic & adder

DRAM for (p, r)-pairs

−A + S ≤ r < −A+ 2S

control logic & adder

DRAM for (p, r)-pairs

control logic & adder

DRAM for (p, r)-pairs

control logic & adder

- - -

- - -

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 11

DRAM holds < 100,000 (p,r)-pairs 3 < #DRAMs < 389 (pmax / S) 256 stations for p > 1.5 • 107 ≈ 227.2

Distributed on 32 chips:size: 472 mm2 (0.13 µm process)output: 448 bit per clock cyclememory: 99%, logic: 1%

Largish Stations (Type I)

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 12

Collection Unit Distributed on 4

chips, each holding 4 arrays of 32 x 32

counting units. Each unit is in charge

of 212 sieve locations, and adding up the

log(p) values.

countingunit

input part

countingunit

countingunit

countingunit

input part

countingunit

countingunit

countingunit

input part

countingunit

countingunit

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 13

Medium and Smallish Primes

Medium Primes: Calculated close to the Counting Units

Smallish Primes: calculated in the Counting Parts stored (added) in separate DRAM

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 14

Collection Unit (area estimates)

2,2

cm 22 L staType 290 mm2

16 L staType 346 mm2

16 L staType 3 46 mm2

20 M sta

20 mm

2

2 M sta 2 m

m2

2 M sta 2 m

m2

2 M sta 2 m

m2

2 M sta 2 m

m2

array 50 mm2

array 50 mm2

array 50 mm2

array 50 mm2

22 L sta Type 290 mm2

Distributed on4 chips:

size: 493 mm2

(0.13 µm process)input: 3584 bit / ccmemory: 94%logic: 6%

23.5.2007 Sieving with Small Chips (Geiselmann, Steinwandt) 15

Performance Total silicon area 172 cm2

One subinterval (S=226) in 53,000 cc One sieve line in 25 min (600 MHz) Sieving of a 1024 bit number with

8300 devices in one year 3.5 x more silicon area than TWIRL or 2.0 x more after modification

top related