Page 17 of 26

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 1:14 am
by OrgZ
Sedat Canbaz wrote:
Wed Mar 27, 2024 8:45 pm
Sedat Canbaz wrote:
Wed Mar 27, 2024 8:43 pm
Dear Sedat great idea!
My cutechess settings
Depth: 13 plies
8 concurrent games


UHO_2022_8mvs_+170_+179.pgn
https://pixeldrain.com/u/Bx4KaYtK
Dear Tanick,

Thanks again....

Yes.. the idea is not so so bad... )

But at least I can say,
The planning new test is never seen before for sure )
But you know, my tours, testings do not looks to others..

I wonder really...and in my opinion is lottery to see, to determine
the real Elo difference.. if two Top engines and close in strength...
Sure I mean if running via UHO and just 150 games (per player)

But anyhow, after all,
Time will tell about who will win each time.. right ? )

Btw, after quick checking...(your previous test)
I noticed very bad games..just one example,
Which is ended in 12 moves....and the other game
Also ended in 12 moves.. I mean same opening as twice..

Code: Select all

[Event "My Tournament"]
[Site "?"]
[Date "2024.03.24"]
[Round "66"]
[White "Stockfish 240322_avx2"]
[Black "Marauders 3.0_avx2"]
[Result "1-0"]
[ECO "A10"]
[PlyCount "24"]
[EventDate "2024.??.??"]
[TimeControl "60+1"]

1. c4 {book} d6 {book} 2. d4 {book} f5 {book} 3. Nc3 {book} Nf6 {book} 4. Bg5 {
book} g6 {book} 5. Bxf6 {book} exf6 {book} 6. e3 {book} Bg7 {book} 7. Bd3 {book
} f4 {-1.62/19 4.2s} 8. exf4 {+1.89/19 3.7s} f5 {-1.83/20 4.3s} 9. Nf3 {
+1.91/19 2.3s} O-O {-1.90/19 4.7s} 10. h4 {+2.05/17 1.4s} c5 {-1.82/19 7.0s}
11. d5 {+1.77/20 5.3s} Re8+ {-1.82/19 5.1s} 12. Kf1 {+1.82/19 2.1s} h5 {
-1.82/19 3.0s, White wins by adjudication} 1-0
What I can say more,
FUN is important....but at least the openings should
A little bit more serious... otherwise...what a pity that,
I can not say such as FUN.. just waste of time...

And let's hope to see, to appear less similar games ))
But via UHO openings... no any guarantee..right ?? ))

Greetings
That's my adjudication settings, where both engines agree on a draw, loss or victory ;)
you can see the evaluation is over +1.80, if still not so clear look at other adjudications you will see. ;)

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 1:29 am
by Sedat Canbaz
OrgZ wrote:
Thu Mar 28, 2024 1:14 am
Sedat Canbaz wrote:
Wed Mar 27, 2024 8:45 pm
Sedat Canbaz wrote:
Wed Mar 27, 2024 8:43 pm
Dear Sedat great idea!
My cutechess settings
Depth: 13 plies
8 concurrent games


UHO_2022_8mvs_+170_+179.pgn
https://pixeldrain.com/u/Bx4KaYtK
Dear Tanick,

Thanks again....

Yes.. the idea is not so so bad... )

But at least I can say,
The planning new test is never seen before for sure )
But you know, my tours, testings do not looks to others..

I wonder really...and in my opinion is lottery to see, to determine
the real Elo difference.. if two Top engines and close in strength...
Sure I mean if running via UHO and just 150 games (per player)

But anyhow, after all,
Time will tell about who will win each time.. right ? )

Btw, after quick checking...(your previous test)
I noticed very bad games..just one example,
Which is ended in 12 moves....and the other game
Also ended in 12 moves.. I mean same opening as twice..

Code: Select all

[Event "My Tournament"]
[Site "?"]
[Date "2024.03.24"]
[Round "66"]
[White "Stockfish 240322_avx2"]
[Black "Marauders 3.0_avx2"]
[Result "1-0"]
[ECO "A10"]
[PlyCount "24"]
[EventDate "2024.??.??"]
[TimeControl "60+1"]

1. c4 {book} d6 {book} 2. d4 {book} f5 {book} 3. Nc3 {book} Nf6 {book} 4. Bg5 {
book} g6 {book} 5. Bxf6 {book} exf6 {book} 6. e3 {book} Bg7 {book} 7. Bd3 {book
} f4 {-1.62/19 4.2s} 8. exf4 {+1.89/19 3.7s} f5 {-1.83/20 4.3s} 9. Nf3 {
+1.91/19 2.3s} O-O {-1.90/19 4.7s} 10. h4 {+2.05/17 1.4s} c5 {-1.82/19 7.0s}
11. d5 {+1.77/20 5.3s} Re8+ {-1.82/19 5.1s} 12. Kf1 {+1.82/19 2.1s} h5 {
-1.82/19 3.0s, White wins by adjudication} 1-0
What I can say more,
FUN is important....but at least the openings should
A little bit more serious... otherwise...what a pity that,
I can not say such as FUN.. just waste of time...

And let's hope to see, to appear less similar games ))
But via UHO openings... no any guarantee..right ?? ))

Greetings
That's my adjudication settings, where both engines agree on a draw, loss or victory ;)
you can see the evaluation is over +1.80, if still not so clear look at other adjudications you will see. ;)
Thanks ..now is more clear.. :)

But if still not so clear..I suggest to increase your GUI's adjudication settings ;)

Btw,
You may know or not...normally, I am accustomed well with UHO openings
E.g a lot of UHO openings games are ended in loss/win in less than 35 moves
And if I am not wrong...my used adjudication settings were above than + 5

For more info:
https://sites.google.com/site/computers ... testings-1

Greetings

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 1:31 am
by Sedat Canbaz
OrgZ wrote:
Thu Mar 28, 2024 1:05 am
Thanks again for the wonderful tests Sedat! :D

and it is important to note that Hash size doesn't always favour both engines. some perform better in low hash and others in larger ones. and this is a bit difficult to do bcz then you would have to run a series of tests to try to find each engines best hash size.... :?
Not at all...dear Tanick :difus_19

Greetings

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 5:44 pm
by OrgZ
Interesting by Clover :?

Code: Select all

TC: 1m+1s
suite: Balsa_270423.pg

Score of Alexandria-6.1.0-avx2 vs viridithas-12.0.0-x86_64-avx2: 4 - 1 - 20 [0.560]
...      Alexandria-6.1.0-avx2 playing White: 3 - 1 - 9  [0.577] 13
...      Alexandria-6.1.0-avx2 playing Black: 1 - 0 - 11  [0.542] 12
...      White vs Black: 3 - 2 - 20  [0.520] 25
Elo difference: 41.9 +/- 60.1, LOS: 91.0 %, DrawRatio: 80.0 %
25 of 25 games finished.

Score of Alexandria-6.1.0-avx2 vs clover_6119_64_ja: 1 - 6 - 18 [0.400]
...      Alexandria-6.1.0-avx2 playing White: 1 - 2 - 10  [0.462] 13
...      Alexandria-6.1.0-avx2 playing Black: 0 - 4 - 8  [0.333] 12
...      White vs Black: 5 - 2 - 18  [0.560] 25
Elo difference: -70.4 +/- 70.5, LOS: 2.9 %, DrawRatio: 72.0 %
25 of 25 games finished.

Score of velvet-v7.1.0-x86_64-avx2 vs clover_6119_64_ja: 0 - 10 - 15 [0.300]
...      velvet-v7.1.0-x86_64-avx2 playing White: 0 - 4 - 9  [0.346] 13
...      velvet-v7.1.0-x86_64-avx2 playing Black: 0 - 6 - 6  [0.250] 12
...      White vs Black: 6 - 4 - 15  [0.540] 25
Elo difference: -147.2 +/- 81.5, LOS: 0.1 %, DrawRatio: 60.0 %
25 of 25 games finished.
games
https://pixeldrain.com/u/GQESK3EW

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 6:10 pm
by OrgZ
Mr Bob v1.3.0 vs Clover 6.1.1.9:?

Code: Select all

TC:1m+1s
Suite: Balsa
Score of bob_avx2 vs clover_6119_64_ja: 0 - 22 - 3 [0.060]
...      bob_avx2 playing White: 0 - 11 - 2  [0.077] 13
...      bob_avx2 playing Black: 0 - 11 - 1  [0.042] 12
...      White vs Black: 11 - 11 - 3  [0.500] 25
Elo difference: -478.0 +/- nan, LOS: 0.0 %, DrawRatio: 12.0 %
25 of 25 games finished.
games:
https://pixeldrain.com/u/BT4ysmFs

Re: Ramz Tours & TESTS

Posted: Thu Mar 28, 2024 7:29 pm
by Homayoun
Thanks for the test. Alexandria and clover can be good opponents for each other. Mr. Bob is very weak for clover.
Best regards

Re: Ramz Tours & TESTS

Posted: Fri Mar 29, 2024 5:15 am
by OrgZ
Homayoun wrote:
Thu Mar 28, 2024 7:29 pm
Thanks for the test. Alexandria and clover can be good opponents for each other. Mr. Bob is very weak for clover.
Best regards
:difus_19
i was expecting a little more from Mr bob. but i guess clover became stronger :?

Re: Ramz Tours & TESTS

Posted: Fri Mar 29, 2024 8:29 am
by Homayoun
OrgZ wrote:
Fri Mar 29, 2024 5:15 am
Homayoun wrote:
Thu Mar 28, 2024 7:29 pm
Thanks for the test. Alexandria and clover can be good opponents for each other. Mr. Bob is very weak for clover.
Best regards
:difus_19
i was expecting a little more from Mr bob. but i guess clover became stronger :?
Yes, Clover 6.1.19 is completely different engine from Clover 6.1. Playing style, evaluation, totally every parameter has improved. Much more reliable engine now.
Best regards

Re: Ramz Tours & TESTS

Posted: Sat Mar 30, 2024 4:35 pm
by OrgZ
Greetings All!

This is my way of testing books in a fun way. This way i get to see real talent and not just E4,d4 lines from tournaments.
I use openings that are not long, but very short lines. I will utilize vast engines in various tests that i will do.
Tc: 30s + 0.6s....Hash 64, i may increase the hash and threads....Running 14th Gen Core i7-14700K Desktop Processor having a total of 20 cores, divided into 8 performance cores (P) and 12 efficiency cores (E). ;)

Engine: Raid v3.5
TC: 30m+0.6s
Engine 1 Engine 2 in that order
Pheonix vs Olisfish270324

Code: Select all

Score of Engine 1 vs Engine 2: 14 - 18 - 88 [0.483]
...      Engine 1 playing White: 12 - 3 - 45  [0.575] 60
...      Engine 1 playing Black: 2 - 15 - 43  [0.392] 60
...      White vs Black: 27 - 5 - 88  [0.592] 120
Elo difference: -11.6 +/- 32.1, LOS: 24.0 %, DrawRatio: 73.3 %
120 of 120 games finished.
congrats Skynet!, though i would say, buddy were are lucky, Hurnavich was leading :lol:

Engine 1 Engine 2 in that order
Optimus2403 vs Olisfish270324

Code: Select all

Score of Engine 1 vs Engine 2: 18 - 8 - 94 [0.542]
...      Engine 1 playing White: 16 - 2 - 42  [0.617] 60
...      Engine 1 playing Black: 2 - 6 - 52  [0.467] 60
...      White vs Black: 22 - 4 - 94  [0.575] 120
Elo difference: 29.0 +/- 28.7, LOS: 97.5 %, DrawRatio: 78.3 %
120 of 120 games finished.
congrats Conrog!


games:
https://pixeldrain.com/l/vC7FJ5Mw

Re: Ramz Tours & TESTS

Posted: Sat Mar 30, 2024 8:35 pm
by Homayoun
:difus_19