Index of /WallStreet/6351/txt

      Name                    Last modified       Size  Description

[DIR] Parent Directory 20-Mar-2010 15:21 - [TXT] 0001.html 27-Apr-2009 09:28 20k [TXT] 0002.html 27-Apr-2009 09:28 9k [TXT] 0003.html 27-Apr-2009 09:28 9k [TXT] 0004.html 27-Apr-2009 09:28 9k [TXT] 0005.html 27-Apr-2009 09:28 10k [TXT] 0006.html 27-Apr-2009 09:28 13k [TXT] 0007.html 27-Apr-2009 09:28 14k [TXT] 0008.html 27-Apr-2009 09:28 20k [TXT] 0009.html 27-Apr-2009 09:28 11k [TXT] 0010.html 27-Apr-2009 09:28 3k [TXT] 0011.html 27-Apr-2009 09:28 21k [TXT] 0012.html 27-Apr-2009 09:28 15k [TXT] 0013.html 27-Apr-2009 09:28 33k [TXT] 0014.html 27-Apr-2009 09:28 19k [TXT] 0015.html 27-Apr-2009 09:29 32k [TXT] 0016.html 27-Apr-2009 09:29 15k [TXT] 0017.html 27-Apr-2009 09:29 63k [TXT] 0018.html 27-Apr-2009 09:28 59k [TXT] 0019.html 27-Apr-2009 09:29 41k [TXT] 0020.html 27-Apr-2009 09:28 5k [TXT] 0021.html 27-Apr-2009 09:28 49k [TXT] 0022.html 27-Apr-2009 09:28 36k [TXT] 0023.html 27-Apr-2009 09:29 30k [TXT] 0024.html 27-Apr-2009 09:28 82k [TXT] 0025.html 27-Apr-2009 09:28 75k [TXT] 0026.html 27-Apr-2009 09:29 33k [TXT] 0027.html 27-Apr-2009 09:29 55k [TXT] 0028.html 27-Apr-2009 09:29 31k [TXT] 0029.html 27-Apr-2009 09:29 3k [TXT] 0030.html 28-May-2009 19:54 15k [TXT] 0031.html 28-May-2009 19:54 197k [TXT] 0032.html 28-May-2009 19:54 39k [TXT] 0033.html 28-May-2009 19:54 27k [TXT] 0034.html 27-May-2009 22:54 64k [TXT] 0035.html 21-Jul-2009 19:54 83k [TXT] 0036.html 20-Dec-2009 20:54 100k [TXT] 56_game_hitting_stre..> 27-Apr-2009 09:29 5k [TXT] bit.html 08-Jun-2009 18:37 3k [TXT] bnews.html 27-Apr-2009 09:29 20k [TXT] busy_beaver.html 27-Apr-2009 09:29 1k [TXT] distributed_computin..> 27-Apr-2009 09:29 3k [TXT] fermat.html 11-Jan-2010 16:54 6k [TXT] livestation.html 27-Apr-2009 09:29 2k [TXT] mile_per_gallon.html 27-Apr-2009 09:29 1k [TXT] okinawa2004.html 27-Apr-2009 09:29 1k [TXT] okinawa2008.html 27-Apr-2009 09:29 1k [TXT] scifi.html 29-Jun-2009 08:54 25k [TXT] tc-16as.html 08-Jun-2009 18:37 6k [TXT] uucp.html 27-Apr-2009 09:29 20k [TXT] video.html 27-Apr-2009 09:29 33k [TXT] www.americafree.tv.html 27-Apr-2009 09:29 1k

Post-it [Post-it]

---20100316/memo

http://www.perfsci.com/free-software.asp#giantint

		k = 19;
		while ((unsigned int)k<B)
		{
			if (isprime(k))
			{
				ell_mul(xr, zr, k, An, Ad, N);
				if (k<500)            小さい素数は2回含む
					ell_mul(xr, zr, k, An, Ad, N);
				if (cnt++ %1000==0)
					dot();
			}
			k += 2;
		}

第一ステージ

---20100316/memo

http://homepage2.nifty.com/m_kamada/math/ecm_ja.htm

未知の素因数の桁数とB1の関係(READMEより)
知の素因数の桁数	B1	繰り返す回数と発見できる確率
				63%	86%	95%

20			11000	74	148	222
25			50000	214	428	642
30			250000	430	860	1290
35			1000000	904	1808	2712
40			3000000	2350	4700	7050
45			11000000	4480	8960	13440
50			43000000	7553	15106	22659
55			110000000	17769	35538	53307
60			260000000	42017	84034	126051
65			850000000	69408	138816	208224

http://mersenneforum.org/showthread.php?t=13146

 73 digit ECM factor
we (Joppe Bos, Thorsten Kleinjung, Arjen Lenstra, Peter Montgomery)
found the following 73-digit prime factor
1808422353177349564546512035512530001279481259854248860454348989451026887
of 2^1181-1 by ECM, completing the factorisation of this number.
Stage 1: we implemented arithmetic functions for Playstation3s for
Mersenne numbers. Stage 1 for 24 curves in parallel and for B1=3*10^9
took less than 23 hours on one PS3, i.e., less than one hour per curve
per PS3.
Stage 2: we parallelised some functions, this stage with the default value
of B2 of about 10^14 took about 15 minutes on 4 cores (per curve).
We ran more than 30000 stage 1 and 8800 stage 2 computations.
Using B1=3000000000-3000000000, B2=103971375307818, polynomial x^1, sigma=4000027779
Step 1 took 0ms

http://mersenneforum.org/showthread.php?t=13051
GIMPS' second Fermat factor!
F14 has a factor
116928085873074369829035993834596371340386703423373313
I found this after approximately 750 curves ran on F14 with B1=110M.

http://mersenneforum.org/showthread.php?t=12168
 GIMPS' first Fermat factor!
User Buckle turned in this gem recently!
F19 has a factor: 37590055514133754286524446080499713
ECM found a factor in curve #3, stage #2
Sigma=7121198363696307, B1=3000000, B2=300000000.
UID: Buckle/G810, F19 has a factor: 37590055514133754286524446080499713

ECM on Fermat numbers

Digits in factor	25	30	35	40	45	50	55	60
Bound #1	50,000	250,000	1,000,000	3,000,000	11,000,000	44,000,000	110,000,000	260,000,000
Curves to test	280	640	1,580	4,700	9,700	17,100	46,500	112,000
	4096	Done	Done	Done	Done	Done	Done	Done	23,152


  To attack F_n = 2^(2^n) + 1 with recommended internal paremeters:

    % fermat n
 
    To test performance/integrity on your machine:

    % fermat n -t

    (which runs known elliptic curves, with certain expected results).
    To force a stage limit of B = 1000000, say:

    % fermat n -b 1000000 


---20100313/memo

http://www.perfsci.com/free-software.asp#giantint

giantint/numberth$ cc -O factor.c giants.c -lm -o factorfactor.c: In function ‘main’:
factor.c:688: warning: format ‘%d’ expects type ‘int’, but argument 4 has type ‘long int’

giantint/numberth$ time ./factor -1 137
Sieving...

Commencing Pollard rho...
...
Commencing second stage, curve 15...
..............................................................................................................................................................................................................................................................................................................................................
32032215596496435569
* 5439042183600204290159

real    0m20.900s
user    0m20.845s
sys     0m0.028s

giantint/numberth$ time ./factor +1 137
...
Commencing second stage, curve 18...
..............................................................................................................................................................................................................................................................................................................................................
32127963626435681
* 105498212027592977

real    0m27.445s
user    0m27.394s
sys     0m0.040s

giantint/numberth$ time ./fermat 7
Attacking 2^128 + 1
Selecting elliptic curve 1 seed s = 2006972216; B = 10000000, C = 500000000:
.............................................................................................................


---20100313/memo

http://www.rkmath.rikkyo.ac.jp/~kida/kima.htm

木田祐司先生作成のUBASICで書かれた楕円曲線法のプログラム GMP化が目標

 10 ’ecm2
 20 input "n=";N
 30 gosub *Ecm2
 40 print N;"=";GD;"*";N\GD
 50 end
 60 ’
 70 *Ecm2
 80 dim F(48)
 90 P=isqrt(N)
100 L1=min(50000,int(1400*exp((log(P)-31.0)*0.14))+100)
110 L2=40*L1
120 LG=log(L1):LS=isqrt(L1)
130 print "L1,L2=";L1;L2
140 randomize
150 ’
160 *MainLoop
170 EC%=irnd:U=EC%+1

UBASIC では % の付いた変数は −32767 から 32767 までの整数

180 A0=2*U*modinv(3*U^2-1,N)@N

modinv ある数を法とする逆数計算

190 if A0*(A0^2-1)*(9*A0^2-1)@N=0 then 170
200 A=(-3*A0^4-6*A0^2+1)*modinv(4*A0^3,N)@N
210 AA=(A+2)*modinv(4,N)@N
220 X=(3*A0^2+1)@N:Z=4*A0@N
230 print "A=";A;" X,Z=";X;Z
240 ’* 1st step
250 ’ * for power of 2
260 for I%=1 to int(LG/log(2))
270   W1=(X+Z)^2@N:W2=(X-Z)^2@N
280   X=W1*W2@N:Z=(W1-W2)*(W2+AA*(W1-W2)@N)@N
290 next
300 GD=gcd(Z,N):if GD>1 then *Atta
310 ’ * for powers of odd primes
320 IP%=2
330 repeat
340   P=prm(IP%):IP%+=1
350   for I%=1 to int(LG/log(P))
360     gosub *ECmulti(&X,&Z,P)
370   next
380   GD=gcd(Z,N):if GD>1 then *Atta
390 until prm(IP%)>L1
400 ’* 2nd STEP
410 ’* make polynomial
420 UX=X:UZ=Z:’                                =R
430 Deg%=1:F(1)=1:F(0)=(-X*modinv(Z,N))@N
440 W1=(X+Z)^2@N:W2=(X-Z)^2@N
450 TX=W1*W2@N:TZ=(W1-W2)*(W2+AA*(W1-W2)@N)@N:’=2R
460 W1=(X-Z)*(TX+TZ)@N:W2=(X+Z)*(TX-TZ)@N
470 X=(W1+W2)^2@N*UZ@N:Z=(W1-W2)^2@N*UX@N:’    =3R
480 for I%=5 to 210 step 2
490   WX=X:WZ=Z
500   W1=(X-Z)*(TX+TZ)@N:W2=(X+Z)*(TX-TZ)@N
510   X=(W1+W2)^2@N*UZ@N:Z=(W1-W2)^2@N*UX@N:’ =7R,9R,...
520   if I%=105 then DX=X:DZ=Z:’              =105R
530   if gcd(I%,210)=1 then
540     :Deg%+=1:Xw=(-X*modinv(Z,N))@N
550     :F(Deg%)=1
560     :for J%=Deg%-1 to 1 step -1
570       :F(J%)=(F(J%)*Xw+F(J%-1))@N
580     :next
590     :F(0)=F(0)*Xw@N
600   UX=WX:UZ=WZ
610 next
620 ’
630 W1=(DX+DZ)^2@N:W2=(DX-DZ)^2@N
640 X=W1*W2@N:Z=(W1-W2)*(W2+AA*(W1-W2)@N)@N:’ =210R
650 UX=X:UZ=Z
660 W1=(X+Z)^2@N:W2=(X-Z)^2@N
670 TX=W1*W2@N:TZ=(W1-W2)*(W2+AA*(W1-W2)@N)@N:’=420R
680 W1=(X-Z)*(TX+TZ)@N:W2=(X+Z)*(TX-TZ)@N
690 X=(W1+W2)^2@N*UZ@N:Z=(W1-W2)^2@N*UX@N:’    =630R
700 ’
710 for Q%=2 to L2\420
720   X1=X*modinv(Z,N)@N:X2=X1^2@N:X3=X1*X2@N:X4=X2^2@N
730   W=1
740   for J%=47 to 0 step -4
750     W=(W*X4+F(J%)*X3+F(J%-1)*X2+F(J%-2)*X1+F(J%-3))@N
760   next
770   GD=gcd(W,N):if GD>1 then cancel for:goto *Atta
780   WX=X:WZ=Z
790   W1=(X-Z)*(TX+TZ)@N:W2=(X+Z)*(TX-TZ)@N
800   X=(W1+W2)^2@N*UZ@N:Z=(W1-W2)^2@N*UX@N
810   UX=WX:UZ=WZ
820 next:Q%=0
830 goto *MainLoop:’try next curve
840 ’
850 *Atta
860 print:return:’GD is a divisor
870 ’
880 ’* main subroutine
890 ’    M must be odd
900 ’    N,AA : global
910 *ECmulti(&X,&Z,M)
920   local I%,TX,TZ,UX,UZ,W1,W2,W3,W4
930   TX=X:TZ=Z:UX=X:UZ=Z
940   W3=TX+TZ:W4=TX-TZ
950   for I%=1 to len(M)-1
960     W1=W3^2@N:W2=W4^2@N
970     TX=W1*W2@N:TZ=(W1-W2)*(W2+AA*(W1-W2)@N)@N
980     W3=TX+TZ:W4=TX-TZ
990     if bit(I%,M)
1000        :then W1=W4*(X+Z)@N:W2=W3*(X-Z)@N
1010          :X=(W1+W2)^2@N*UZ@N:Z=(W1-W2)^2@N*UX@N
1020        :else W1=W4*(UX+UZ)@N:W2=W3*(UX-UZ)@N
1030          :UX=(W1+W2)^2@N*Z@N:UZ=(W1-W2)^2@N*X@N
1040   next
1050 return

run
n=? 18^16+1
L1,L2= 564 22560
A= 90032034639039456184
  X,Z= 65814232592604926128  45923549317295498171
A= 77216027138829881607
  X,Z= 94698138176341449409  52282093912201079366
A= 10384364739415668074
  X,Z= 27901642140712720029  35238368532640931443
 121439531096594251777 = 30894471809 * 3930785153

http://www.perfsci.com/free-software.asp#giantint

には、CソースのECM法

---20100201/CUDA

http://www.nvidia.com/object/cuda_develop.html

"CUDA 2.2 Quick Start Guide" is Install Guide.

---20100131/memo

$ time ./a.out f24.param |more
GeneFer 1.3 (standard)  Copyright (C) 2001-2002, Yves GALLOT
A program for finding large probable generalized Fermat primes.

Usage: GeneFer -b            run bench
       GeneFer -t            run test
       GeneFer <filename>    test <filename>
       GeneFer               use interactive mode

Start test of file 'f24.param'.
 3.96e+00 536^1058076+1...
 0.00e+00 

$ time ./a.out f26.param |more
GeneFer 1.3 (standard)  Copyright (C) 2001-2002, Yves GALLOT
A program for finding large probable generalized Fermat primes.

Usage: GeneFer -b            run bench
       GeneFer -t            run test
       GeneFer <filename>    test <filename>
       GeneFer               use interactive mode

Start test of file 'f26.param'.
 4.00e+00 536^4194304+1...
 0.00e+00 

---20100123/memo

http://www.mersenne.org/report_exponent/?exp_lo=36042257&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22018729&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22020091&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22021367&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22041823&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22052431&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22676837&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22676839&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22679021&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22679203&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22681523&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22860017&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22860527&exp_hi=&B1=Get+status

http://www.mersenne.org/report_exponent/?exp_lo=22873453&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22882129&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22893397&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=23277439&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=23277557&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=23704859&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=23704969&exp_hi=&B1=Get+status

http://www.mersenne.org/report_exponent/?exp_lo=22688521&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22833439&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22842103&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22849927&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22856573&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22876219&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22889017&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22892447&exp_hi=&B1=Get+status

---20100111/CUDA

On ubuntu9.04/32bit/GTX260

$ time ./mfaktc.exe 66362159 64 65
mfaktc v0.01  C...
...
no factor for M66362159 from 2^64 to 2^65 bits
tf(): total time spent: 273133msec

real    4m33.207s
user    4m30.925s
sys     0m2.288s

Compiletime Options
  THREADS_PER_GRID    1048576
  THREADS_PER_BLOCK   256
  SIEVE_SIZE_LIMIT    32kiB
  SIEVE_SIZE          230945bits
  SIEVE_PRIMES        50000
  USE_PINNED_MEMORY   enabled
  USE_ASYNC_COPY      enabled
  VERBOSE_TIMING      disabled
  SELFTEST            disabled
  MORE_CLASSES        disabled


---20100109/SIGPS

69991758 2048 12289
I think it cannot be really the last candidate until 70M

$tail 60000000 
69991628 2048 12289
69991640 2048 12289
69991650 2048 61441
69991676 2048 12289
69991692 2048 12289
69991694 2048 12289
69991696 2048 12289
69991730 2048 12289
69991758 2048 12289

project3
12289,40961,61441,65537,86017
f0:2000000-230000000の割算の結果
s0:候補
f1:候補の篩いの結果
s1:残ったもの

2336,414724444161
    ,123456789012

> 69991832 2048 12289
> 69991924 2048 40961
> 69991944 2048 12289
> 69991990 2048 12289
> 69992004 2048 61441
> 69992028 2048 12289
> 69992032 2048 86017
> 69992046 2048 12289
> 69992078 2048 12289
> 69992090 2048 12289
> 69992134 2048 40961
> 69992148 2048 12289
> 69992158 2048 12289
> 69992220 2048 40961
> 69992226 2048 40961
> 69992248 2048 61441
> 69992256 2048 12289
> 69992274 2048 86017
> 69992288 2048 61441
> 69992298 2048 12289
> 69992310 2048 12289
> 69992322 2048 12289
> 69992342 2048 40961
> 69992362 2048 40961
> 69992416 2048 40961
> 69992424 2048 12289
> 69992458 2048 12289
> 69992462 2048 40961
> 69992554 2048 12289
> 69992644 2048 12289
> 69992652 2048 86017
> 69992672 2048 12289
> 69992674 2048 40961
> 69992724 2048 12289
> 69992824 2048 40961
> 69992830 2048 12289
> 69992884 2048 12289
> 69992930 2048 12289
> 69992952 2048 12289
> 69993062 2048 86017
> 69993068 2048 61441
> 69993076 2048 12289
> 69993082 2048 12289
> 69993114 2048 12289
> 69993198 2048 12289
> 69993246 2048 12289
> 69993320 2048 12289
> 69993336 2048 12289
> 69993354 2048 40961
> 69993356 2048 12289
> 69993366 2048 12289
> 69993376 2048 12289
> 69993382 2048 40961
> 69993434 2048 12289
> 69993438 2048 12289
> 69993498 2048 12289
> 69993500 2048 40961
> 69993502 2048 12289
> 69993534 2048 86017
> 69993588 2048 12289
> 69993634 2048 40961
> 69993648 2048 12289
> 69993760 2048 12289
> 69993782 2048 61441
> 69993786 2048 61441
> 69993788 2048 86017
> 69993846 2048 61441
> 69993850 2048 86017
> 69993888 2048 12289
> 69993900 2048 12289
> 69993904 2048 61441
> 69993950 2048 61441
> 69993956 2048 12289
> 69994070 2048 12289
> 69994152 2048 12289
> 69994180 2048 12289
> 69994228 2048 40961
> 69994234 2048 61441
> 69994238 2048 86017
> 69994258 2048 12289
> 69994264 2048 12289
> 69994270 2048 12289
> 69994274 2048 61441
> 69994278 2048 12289
> 69994292 2048 40961
> 69994310 2048 86017
> 69994312 2048 12289
> 69994328 2048 86017
> 69994338 2048 61441
> 69994344 2048 12289
> 69994378 2048 86017
> 69994392 2048 12289
> 69994420 2048 12289
> 69994422 2048 40961
> 69994510 2048 12289
> 69994522 2048 40961
> 69994546 2048 12289
> 69994554 2048 86017
> 69994580 2048 12289
> 69994600 2048 12289
> 69994664 2048 86017
> 69994682 2048 12289
> 69994738 2048 12289
> 69994744 2048 61441
> 69994748 2048 12289
> 69995010 2048 12289
> 69995172 2048 12289
> 69995174 2048 12289
> 69995178 2048 86017
> 69995192 2048 12289
> 69995234 2048 40961
> 69995240 2048 40961
> 69995244 2048 40961
> 69995290 2048 12289
> 69995314 2048 12289
> 69995320 2048 40961
> 69995324 2048 12289
> 69995372 2048 12289
> 69995474 2048 12289
> 69995476 2048 61441
> 69995496 2048 12289
> 69995504 2048 12289
> 69995520 2048 12289
> 69995604 2048 12289
> 69995660 2048 12289
> 69995664 2048 12289
> 69995676 2048 61441
> 69995680 2048 12289
> 69995690 2048 12289
> 69995700 2048 12289
> 69995738 2048 12289
> 69995776 2048 61441
> 69995834 2048 12289
> 69995846 2048 12289
> 69995856 2048 12289
> 69995878 2048 12289
> 69995906 2048 12289
> 69995908 2048 12289
> 69995922 2048 12289
> 69995932 2048 61441
> 69995934 2048 40961
> 69995944 2048 12289
> 69995964 2048 40961
> 69995972 2048 12289
> 69995994 2048 12289
> 69996026 2048 40961
> 69996040 2048 12289
> 69996058 2048 86017
> 69996070 2048 61441
> 69996082 2048 12289
> 69996094 2048 12289
> 69996126 2048 12289
> 69996142 2048 12289
> 69996144 2048 61441
> 69996174 2048 61441
> 69996190 2048 86017
> 69996206 2048 12289
> 69996222 2048 40961
> 69996458 2048 12289
> 69996510 2048 86017
> 69996574 2048 12289
> 69996602 2048 12289
> 69996624 2048 12289
> 69996642 2048 40961
> 69996732 2048 12289
> 69996742 2048 12289
> 69996816 2048 40961
> 69996958 2048 12289
> 69996970 2048 12289
> 69996990 2048 12289
> 69996998 2048 86017
> 69997002 2048 12289
> 69997030 2048 12289
> 69997032 2048 61441
> 69997050 2048 40961
> 69997076 2048 12289
> 69997090 2048 12289
> 69997100 2048 40961
> 69997102 2048 61441
> 69997112 2048 12289
> 69997118 2048 61441
> 69997148 2048 12289
> 69997152 2048 12289
> 69997154 2048 12289
> 69997160 2048 12289
> 69997254 2048 12289
> 69997266 2048 40961
> 69997276 2048 12289
> 69997308 2048 40961
> 69997320 2048 40961
> 69997324 2048 12289
> 69997348 2048 12289
> 69997386 2048 12289
> 69997408 2048 12289
> 69997424 2048 40961
> 69997454 2048 12289
> 69997456 2048 40961
> 69997470 2048 12289
> 69997502 2048 12289
> 69997548 2048 12289
> 69997580 2048 40961
> 69997610 2048 61441
> 69997630 2048 12289
> 69997652 2048 61441
> 69997674 2048 61441
> 69997678 2048 61441
> 69997728 2048 86017
> 69997758 2048 40961
> 69997770 2048 12289
> 69997772 2048 12289
> 69997774 2048 12289
> 69997792 2048 12289
> 69997800 2048 12289
> 69997914 2048 12289
> 69997950 2048 12289
> 69998020 2048 12289
> 69998024 2048 40961
> 69998034 2048 12289
> 69998044 2048 61441
> 69998068 2048 12289
> 69998098 2048 86017
> 69998122 2048 61441
> 69998146 2048 40961
> 69998158 2048 86017
> 69998194 2048 40961
> 69998300 2048 61441
> 69998326 2048 40961
> 69998338 2048 12289
> 69998358 2048 12289
> 69998372 2048 12289
> 69998374 2048 12289
> 69998410 2048 40961
> 69998428 2048 61441
> 69998452 2048 12289
> 69998476 2048 12289
> 69998500 2048 12289
> 69998516 2048 12289
> 69998522 2048 40961
> 69998588 2048 12289
> 69998638 2048 12289
> 69998674 2048 61441
> 69998694 2048 40961
> 69998704 2048 40961
> 69998726 2048 12289
> 69998742 2048 12289
> 69998868 2048 12289
> 69998914 2048 12289
> 69998962 2048 12289
> 69998964 2048 12289
> 69998972 2048 12289
> 69998974 2048 12289
> 69999002 2048 12289
> 69999068 2048 12289
> 69999106 2048 12289
> 69999128 2048 12289
> 69999134 2048 12289
> 69999140 2048 12289
> 69999176 2048 12289
> 69999180 2048 12289
> 69999198 2048 12289
> 69999200 2048 12289
> 69999220 2048 61441
> 69999242 2048 12289
> 69999266 2048 12289
> 69999272 2048 12289
> 69999328 2048 12289
> 69999330 2048 12289
> 69999342 2048 40961
> 69999358 2048 61441
> 69999372 2048 86017
> 69999396 2048 61441
> 69999414 2048 61441
> 69999416 2048 86017
> 69999434 2048 40961
> 69999474 2048 12289
> 69999476 2048 12289
> 69999480 2048 40961
> 69999514 2048 12289
> 69999530 2048 40961
> 69999570 2048 61441
> 69999574 2048 40961
> 69999664 2048 12289
> 69999714 2048 12289
> 69999720 2048 12289
> 69999740 2048 12289
> 69999746 2048 40961
> 69999788 2048 12289
> 69999798 2048 40961
> 69999822 2048 40961
> 69999856 2048 12289
> 69999882 2048 12289
> 69999936 2048 61441
> 69999938 2048 12289
> 69999940 2048 40961
> 69999976 2048 12289
> 69999980 2048 86017

296行

.input 50549184:229999997 2048 2
60000000 357171:69991758 2048 12289
input  50191717:229999997 2048 2

project3/20100110 開始時点のセーブ
project3/20100110.0/input 新 input

2140003^2048+1)/2
2235100^2048+1)/12289
2140398^2048+1)/40961
2235064^2048+1)/61441
65535 は全ては含まれていない。

/project3/20100110.0/s3 setver* から作った。/2 以外。

---20100109/ubuntu 9.04/CUDA 2.3/CUFFT

http://www.mersenne.org/report_exponent/?exp_lo=36042257&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22018729&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22020091&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22021367&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22041823&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22052431&exp_hi=&B1=Get+status

http://www.mersenne.org/report_exponent/?exp_lo=22676837&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22676839&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22679021&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22679203&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22681523&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22860017&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22860527&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22873453&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22882129&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22893397&exp_hi=&B1=Get+status

---20100104/ubuntu 9.04/CUDA 2.3/CUFFT

http://www.mersenne.org/report_exponent/?exp_lo=36042257&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22018729&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22020091&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22021367&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22041823&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=22052431&exp_hi=&B1=Get+status

---20100103/ubuntu 9.04/CUDA 2.3/CUFFT

 MacLucasFFTW.cuda.J.tar.gz

$  time  ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m46.313s
user    0m1.216s
sys     0m0.348s

2048k fft sec/iter = 0.0107

Hi,
Version "J" at .0107 sec/iter for the 2048K
(This version support only 2048k) on GTX260.

$  time  ./MacLucasFFTW 216091
Iteration 10000 M( 216091 )C, 0x30247786758b8792, n = 2097152, MacLucasFFTW v8.1  Ballester
...
Iteration 210000 M( 216091 )C, 0xcfe091c8f59f8a7b, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 216091 )P, n = 2097152, MacLucasFFTW v8.1  Ballester

real    41m36.711s
user    0m21.581s
sys     0m4.932s

$ time ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 8388608, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 8388608, MacLucasFFTW v8.1  Ballester

real    9m4.028s
user    0m2.080s
sys     0m0.600s

---20100103/ubuntu 9.04/CUDA 2.3/CUFFT

$  time  ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m46.683s
user    0m1.252s
sys     0m0.396s

---20100102/ubuntu 9.04/CUDA 2.3/CUFFT

$  ../../bin/linux/release/transposeNew
[TransposeNew]
> Device 0: "GeForce GTX 260"
> SM Capability 1.3 detected:
> CUDA device has 27 Multi-Processors
> SM performance scaling factor = 1.00

Matrix size: 2048x2048 (64x64 tiles), tile size: 32x32, block size: 32x8

Kernel                  Loop over kernel        Loop within kernel
------                  ----------------        ------------------
simple copy             91.25 GB/s              91.48 GB/s
shared memory copy      88.07 GB/s              90.39 GB/s
naive transpose          4.56 GB/s               4.54 GB/s
coalesced transpose     63.89 GB/s              70.29 GB/s
no bank conflict trans  71.45 GB/s              70.85 GB/s
coarse-grained          71.42 GB/s              70.80 GB/s
fine-grained            87.92 GB/s              90.78 GB/s
diagonal transpose      69.93 GB/s              73.15 GB/s

Test PASSED

Press ENTER to exit...

float -> double

$  ../../bin/linux/release/transposeNew
[TransposeNew]
> Device 0: "GeForce GTX 260"
> SM Capability 1.3 detected:
> CUDA device has 27 Multi-Processors
> SM performance scaling factor = 1.00

Matrix size: 2048x2048 (64x64 tiles), tile size: 32x32, block size: 32x8

Kernel                  Loop over kernel        Loop within kernel
------                  ----------------        ------------------
simple copy             92.16 GB/s              83.95 GB/s
shared memory copy      88.38 GB/s              92.47 GB/s
naive transpose          5.27 GB/s               5.36 GB/s
coalesced transpose     73.44 GB/s              72.82 GB/s
no bank conflict trans  74.19 GB/s              72.69 GB/s
coarse-grained          74.18 GB/s              72.69 GB/s
fine-grained            88.35 GB/s              92.45 GB/s
diagonal transpose      83.29 GB/s              90.03 GB/s

Test PASSED

Press ENTER to exit...

AA=0
$  time  ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m56.451s
user    0m1.148s
sys     0m0.280s

AA=8
$   time  ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m56.262s
user    0m1.156s
sys     0m0.312s

AA=16
$  time  ./MacLucasFFTW 33333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m56.293s
user    0m1.148s
sys     0m0.316s
$  time  ./MacLucasFFTW 63333333
Iteration 10000 M( 63333333 )C, 0xa5d7a917d728239a, n = 4194304, MacLucasFFTW v8.1  Ballester
M( 63333333 )C, 0xa5d7a917d728239a, n = 4194304, MacLucasFFTW v8.1  Ballester

real    3m40.366s
user    0m1.576s
sys     0m0.332s

#define NCPU  2048
#define IDX(i) ( ( (i) >> 11 ) +  (((i) & 2047) << 11)  )
        UL j;
        const int numThreads = blockDim.x * gridDim.x;
        const int threadID = blockIdx.x * blockDim.x + threadIdx.x;
        g_N = 1024 * 2048
        const int stride = g_N/NCPU; 1024
        const int js= stride * threadID;
        const int je= js + stride;
        BIG_DOUBLE temp0,tempErr;
        BIG_DOUBLE maxerr=0.0,err=0.0;
        BIG_DOUBLE carry;

 0-1023,1024-2047

---20091231/ps3

 7612 ?        RNl  130441:55 ./MacLucasFFTW.k.6 -c1000000 c43112609
26732 ?        RN   61589:58 ./dg2-1.043.ps3.out input

Hi,

My PS3 vrified M43112609.

$ ./MacLucasFFTW.k.6 -c1000000 43112609
...
 43100001 4194304
 43110001 4194304
M( 43112609 )P, n = 4194304, MacLucasFFTW v8.1  Ballester

ELAPS TIME 59 days

Thank you,

---20091227/ubuntu 9.04/CUDA 2.3/CUFFT

MacLucasFFTW.cuda.H.tar.gz

$ time  ./MacLucasFFTW 33333333
 time  ./MacLucasFFTW 63333333
Iteration 10000 M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 33333333 )C, 0xd717246f501c7d94, n = 2097152, MacLucasFFTW v8.1  Ballester

real    1m56.446s
user    0m1.296s
sys     0m0.320s
$ time  ./MacLucasFFTW 63333333
Iteration 10000 M( 63333333 )C, 0xa5d7a917d728239a, n = 4194304, MacLucasFFTW v8.1  Ballester
M( 63333333 )C, 0xa5d7a917d728239a, n = 4194304, MacLucasFFTW v8.1  Ballester

real    3m40.547s
user    0m1.520s
sys     0m0.388s

$ time ./MacLucasFFTW 216091
Iteration 10000 M( 216091 )C, 0x30247786758b8792, n = 2097152, MacLucasFFTW v8.1  Ballester
...
Iteration 210000 M( 216091 )C, 0xcfe091c8f59f8a7b, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 216091 )P, n = 2097152, MacLucasFFTW v8.1  Ballester

real    43m39.499s
user    0m19.289s
sys     0m4.268s

Hi,
Version "H" at .0117 sec/iter for the 2048K FFT and .0221 sec/iter for the 4096K FFT on GTX260.
and GTX260 result.

http://www.mersenne.org/report_exponent/?exp_lo=36000127&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36000521&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36010921&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36007753&exp_hi=&B1=Get+status

---20091227/memo

ps4

 7612 ?        RNl  130441:55 ./MacLucasFFTW.k.6 -c1000000 c43112609
26732 ?        RN   61589:58 ./dg2-1.043.ps3.out input

ELAPS TIME 59 days

ps3

 1879 ?        S      0:00 time ./f26.ps3.1.out f26.param
 1880 ?        Rl   182157:55 ./f26.ps3.1.out f26.param
32654 ?        RN   146379:00 ./dg2-1.043.ps3.out input



http://www.mersenne.org/report_exponent/?exp_lo=36000127&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36000521&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36010921&exp_hi=&B1=Get+status
http://www.mersenne.org/report_exponent/?exp_lo=36007753&exp_hi=&B1=Get+status

---20091222/ubuntu 9.04/CUDA 2.3/CUFFT

4096*4096*8=128Mbyte

---20091221/ubuntu 9.04/CUDA 2.3/CUFFT

$ time ./MacLucasFFTW 11111111
Iteration 1000 M( 11111111 )C, 0x563e0cb0741ec03d, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 11111111 )C, 0x563e0cb0741ec03d, n = 2097152, MacLucasFFTW v8.1  Ballester

real    0m13.486s
user    0m0.440s
sys     0m0.216s

$  time ./MacLucasFFTW 11111111
Iteration 1000 M( 11111111 )C, 0x563e0cb0741ec03d, n = 1048576, MacLucasFFTW v8.1  Ballester
M( 11111111 )C, 0x563e0cb0741ec03d, n = 1048576, MacLucasFFTW v8.1  Ballester

real    0m7.449s
user    0m0.368s
sys     0m0.108s

$ time ./MacLucasFFTW 1111111
Iteration 1000 M( 1111111 )C, 0x59722dd388ae5361, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 1111111 )C, 0x59722dd388ae5361, n = 2097152, MacLucasFFTW v8.1  Ballester

real    0m12.770s
user    0m0.468s
sys     0m0.156s

$ time ./MacLucasFFTW 1111111
Iteration 1000 M( 1111111 )C, 0x59722dd388ae5361, n = 65536, MacLucasFFTW v8.1  Ballester
M( 1111111 )C, 0x59722dd388ae5361, n = 65536, MacLucasFFTW v8.1  Ballester

real    0m3.107s
user    0m0.208s
sys     0m0.092s

$ time ./MacLucasFFTW 2222222
Iteration 1000 M( 2222222 )C, 0xbddaec90077bf804, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 2222222 )C, 0xbddaec90077bf804, n = 2097152, MacLucasFFTW v8.1  Ballester

real    0m12.705s
user    0m0.464s
sys     0m0.160s

$ time ./MacLucasFFTW 2222222
Iteration 1000 M( 2222222 )C, 0xbddaec90077bf804, n = 131072, MacLucasFFTW v8.1  Ballester
M( 2222222 )C, 0xbddaec90077bf804, n = 131072, MacLucasFFTW v8.1  Ballester

real    0m4.029s
user    0m0.280s
sys     0m0.080s

$ time ./MacLucasFFTW 4444444
Iteration 1000 M( 4444444 )C, 0x2843bac364b588ae, n = 2097152, MacLucasFFTW v8.1  Ballester
M( 4444444 )C, 0x2843bac364b588ae, n = 2097152, MacLucasFFTW v8.1  Ballester

real    0m12.689s
user    0m0.460s
sys     0m0.188s

$ time ./MacLucasFFTW 4444444
Iteration 1000 M( 4444444 )C, 0x2843bac364b588ae, n = 262144, MacLucasFFTW v8.1  Ballester
M( 4444444 )C, 0x2843bac364b588ae, n = 262144, MacLucasFFTW v8.1  Ballester

real    0m4.528s
user    0m0.256s
sys     0m0.112s

---20091220/ubuntu 9.04/CUDA 2.3/CUFFT

2048*2048=2^22

---20091219/ubuntu 9.04/CUDA 2.3/CUFFT

http://forums.nvidia.com/index.php?showtopic=69801&hl=CUFFT

CUDA_INSTALL_PATH ?= /usr/local/cuda
INCLUDES  := -I. -I$(CUDA_INSTALL_PATH)/include

float -> double

$ ./FFT 

Device: GeForce GTX 260, 1404 MHz clock, 895 MB memory.
             --------CUFFT-------  ---This prototype---
   N   Batch Gflop/s  GB/s  error  Gflop/s  GB/s  error
   8  524288    8.7    9.3   1.7     83.1   88.7   1.6
  64   65536   55.0   29.3   2.4    168.5   89.9   2.5
 512    8192  199.0   70.7   2.1    205.1   72.9   2.5

Errors are supposed to be of order of 1 (ULPs).

Device: GeForce GTX 260, 1404 MHz clock, 895 MB memory.
Compiled with CUDA 2030.
             --------CUFFT-------  ---This prototype---  ---two way---
   N   Batch Gflop/s  GB/s  error  Gflop/s  GB/s  error  Gflop/s error
   8 1048576    8.6    9.1   1.7     83.4   89.0   1.6     83.5   2.1
  16  524288   19.3   15.4   2.1    108.0   86.4   1.3    108.0   1.9
  64  131072   55.4   29.6   2.5    170.0   90.7   2.5    168.0   2.9
 256   32768  167.3   66.9   1.7    166.0   66.4   2.0    165.9   2.9
 512   16384  199.8   71.0   2.1    206.2   73.3   2.5    206.5   3.7
1024    8192  188.1   60.2   2.3    177.1   56.7   2.5    177.3   3.9
2048    4096  171.0   49.7   2.6    140.8   41.0   3.0    140.6   4.5
4096    2048  162.4   43.3   2.3    151.6   40.4   3.3    151.4   4.9
8192    1024  166.8   41.1   2.3    164.3   40.4   3.4    163.9   5.2

Errors are supposed to be of order of 1 (ULPs).