Advertised sites are not endorsed by the bitcoin Forum. They may be unsafe, untrustworthy, or illegal in your jurisdiction.
So my first try at a custom strap didn’t work (GPU crashed almost immediately when mining ETH). custom 1900: 1500RAS, 1625CAS, MISC2, & ARB 777000000000000022CC1C00AD515A3ED0570F15B98CA50A004AE7001C0714207A8900A0030000001B11353F922A3217
A straight copy of the 1625 strap to 2000 works fine, while the 1500 strap gave errors even at 1900. I tried taking the 1900 strap, RAS from the 1500, and CAS, MISC2 & ARB2 from the 1625 strap and using it for the 2000 strap.
My friend, you have a lot to learn…I was like u…a few weeks ago, then I read all the documentation regarding GDDR5 and with a little help(well…not so little) I managed to understand what actually those timings do Keep up the good work by the way! EDIT: I am also very keen to understand HBM/2 timings, if anyone has some knowledge on those(I already know the mode registers) any help via PM is highly appreciated!
I know how to make an optimized strap, just like I know how to re-shingle a shed. But tying a tarp over the roof is a lot easier…
Now that the strap tools are out, let’s talk about how to optimize the timings. I want to start with ETH since it is the simplest (and coincidentally the most profitable ATM).
Ethash is many 128-byte random DAG reads, 8KB of them per hash, so 20MH/s requires 160GB/s of random read bandwidth. For AMD cards 128 bytes is 2 cache lines of 64 bytes each, and each cache line fill reads 32 bytes from 2 GDDR5 memory chips. Each 32-byte GDDR5 read burst takes 2 clocks, so when the RAM is clocked at 2GHz, the data will be transferred in 1ns (each bit takes just 125ps!).
Here’s a couple references to help the noobs get started:
I’m not going to do one long post, so as to make this more readable. For the more experienced folks, here’s a tidbit of ideas to come: set tFAW and t32AW to 0. Even Hynix’s old H5GQ1H24AFR has FAW (23ns) =~ 4* RRD (5.5ns), so virtually all modern GGD5 should be able to work fine without FAW and 32AW limits. I get 27.0Mh with sgminer on my Rx470/K4G4 clocked at 2Ghz, tRRD=5, tFAW=0. Zeroing t32AW gives a bump to 27.35Mh.
Newbie Offline
Activity: 4
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
The timings from wolf and ohgodagirls vbios decode tools release: TRCDW = 16 TRCDWA = 16 TRCDR = 26 TRCDRA = 22 TRRD = 5 TRC = 71 Pad0 = 0
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
The timings from wolf and ohgodagirls vbios decode tools release: TRCDW = 16 TRCDWA = 16 TRCDR = 26 TRCDRA = 22 TRRD = 5 TRC = 71 Pad0 = 0
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
The timings from wolf and ohgodagirls vbios decode tools release: TRCDW = 16 TRCDWA = 16 TRCDR = 26 TRCDRA = 22 TRRD = 5 TRC = 71 Pad0 = 0
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
Thanks will give it a shot when I get back on a computer.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
The timings from wolf and ohgodagirls vbios decode tools release:
You should update to the version with my changes that show CAS timing. I see you’re using CL=22. With Samsung CL=21 I was getting errors at 2100 (OK at 2000). I’ll give 22 a try. Here’s what I was using @2000: 555000000000000022CC1C00CE595B3ED0570F1531CB2409004007000B0314207A8900A003000000170F2E36922A3217
I am seeing HW errors with the current mod I’m running but not an exponential amount that affects performance on the pool hash rate. I will be able to test more once I get back to my computer.
Since everyone is sharing now I suppose i’ll put what I’ve come up with out here. Running -125mv 470 Nitro Sapphire 8GB with Samsung memory with ETH hitting between 28.5MH/s to 29.2MH/s @1140 cor and @2100 mem pulling around 920watts at the wall with 6 GPU per rig. On XMR hitting 785h/s to 795h/s @1170 cor and @2100 mem pulling around 660 watts at the wall with 6 GPU per rig. Also running ethOS 1.2.0. Im here to learn more about the mistakes I made on the mod and see what others in the community have come up with.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
Thanks will give it a shot when I get back on a computer.
Here is the strap I’ve put together: 777000000000000022CC1C00106A5B47C0570E16B08C05090068C70014051420FA8900A003000000190D2F399D2D2E17
The timings from wolf and ohgodagirls vbios decode tools release:
You should update to the version with my changes that show CAS timing. I see you’re using CL=22. With Samsung CL=21 I was getting errors at 2100 (OK at 2000). I’ll give 22 a try. Here’s what I was using @2000: 555000000000000022CC1C00CE595B3ED0570F1531CB2409004007000B0314207A8900A003000000170F2E36922A3217
I am seeing HW errors with the current mod I’m running but not an exponential amount that affects performance on the pool hash rate. I will be able to test more once I get back to my computer.
It was cleaned rom, without modding other than SEQ_RAS params, there is delibarate error for you to figure it out. Hint: MC_SEQ_MISC_TIMING EDIT: OhGodAGirl format:
Registers in RX and preRX obviously at different offsets but additionally there is no way to decode MISC with same decoder to produce reasonably similar values for same memory type in RX and R9 cards.
EDIT: For whose who is wondering why TRP_WRA=006 for Elpida in R9 my theory is that it is a bug in the bios (6 bits was designated for field) and 64 from (70=64+6) was cut off.
I’ve started doing the detailed analysis on memory timing for Eth mining.
With tRRD=6, tRC=62, tCL=21 and 2000 mem clock, I can get almost 27Mh/s mining eth. Each hash takes 64 random DAG reads of 128 bytes each, and since they are random, each read should be from a different page. As well, the L2 cache hit rate should be near 0, so each DAG access requires a read from GDDR (2×32-byte reads from 2 GDDR chips).
Before reading, a page (row) has to be activated(opened), so 27Mh * 64 activate = 1728M activates per second. The Rx470/480 has 4 independent cache controllers, so a single GDDR5 chip will open 432M pages per second. With a 2Ghz mem clock, that’s about 5 (4.73) clocks per activate. The closer that gets to 4, the better. Lower than 4 is not possible with Eth mining, since it takes 4 clocks to transfer 64 bytes (half a DAG entry). Note that if tRRD=6, means 6 clocks, some other timing factor is allowing the RAM to sustain <5 clocks per activate
I tried tRRD=5, and it only makes a small (~1%) improvement. That makes sense, since RRD is the delay between 2 activate commands when they are going to different banks. With only 16 banks, the memory controller has lots of opportunity to batch activate commands together in the same bank. However tRC is defined as, “The minimum time interval between two successive ACTIVE commands on the same bank”. With tRC=62, the fastest access pattern would be to spread the accesses across different banks rather than batching them in the same bank.
So it seems I’m missing something about how the RAM timing. I know there are multiple clocks for GDDR5, and some run at double data rate (i.e. WCK). If tRRD=6 means six DDR address clocks, that would be 3 SDR command clocks (2Ghz is the command clock rate).
Re: [ANN] YOURCHAIN – CLAIM YOUR FREE $9300 USD WORTH OF CRYPTO! I’m interested in this “CLAIM YOUR FREE $9300 USD WORTH OF CRYPTO!” Post your ETH address. Let me explain to everyone the magic […]
Trade Organization ICC Eyes Blockchain Adoption for Its 45 Million Members The world’s biggest business organization is making blockchain technology available to its 45 million members, including Amazon, Coca Cola, Fedex, McDonald’s and PayPal. Announced […]
BTCMANAGER Blockchain Wallet is Giving Away $125 Million Worth of Stellar Lumens Cryptocurrency wallet provider Blockchain.com has launched full support for altcoin Stellar (XLM), which will be followed by an airdrop of $125 million worth […]