cross-posted from: https://lemmy.sdf.org/post/43150819

I have a Lenovo ThinkCentre M900 Tiny (10FL) which has been working fine. Recently, I had an HDD failure, and replaced it with an SSD. This time, I decided to go with ZFS (single drive, kinda pointless, but I get scrubbing). Every once in a while, I’m getting errors while scrubbing. It’s always 1-2 read or write errors. And it never reappears if I clear the error and run another scrub.

The data isn’t important, and it’s backed up, so I’m not too worried about it. But, the symptoms make me think that it’s an issue of the SATA port inside? Is it possible to replace the SATA port inside this device. I wasn’t able to find anything like part number etc. online and it looks like I need to replace the whole board. Any help is appreciated.

  • B0rax@feddit.org
    link
    fedilink
    arrow-up
    3
    ·
    8 days ago

    Isn’t the data port plugged in via a flex cable? Maybe check the connections first, unplug and replug. Make sure they are clean.

  • CameronDev@programming.dev
    link
    fedilink
    arrow-up
    2
    ·
    8 days ago

    Its possible usually, but soldering/desoldering on motherboards can be quite tricky, they often have substantial grounds that wick away heat.

    Why do you think its the physical port?

    • soldering/desoldering on motherboards can be quite tricky

      The I guess it’s not gonna work for me. I’m not much of a hardware guy, and have only (badly) soldered a few times.

      It’s a new SSD, no SMART failures, and it’s always read/write errors. I’m not an expert, but I think if the SSD was faulty, I’d get some checksum errors. I don’t think any other parts like RAM and CPU are involved since there’s another 4-drive DAS attached to the machine over USB, and it’s been running ZFS with RAIDZ2 without any issues for more than a year. So that just leaves the port.

      Also, the scrub timings seem to be pretty bad. When I bought the SSD, a scrub would take around 15 minutes. But now it’s taking around 3 hours. The performance is the same in normal use.

      • CameronDev@programming.dev
        link
        fedilink
        arrow-up
        2
        ·
        8 days ago

        Could be a few other things. Power issues, motherboard chipset, SATA cable. I’d try replacing the sata cable first, but I dont think I’d get too fixated on the port itself.

        If its your first time, dont do it. You’ll kill the port with too much heat, or lift a pad. Not worth the risk.

  • kalleboo@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    8 days ago

    Have you checked the SMART value 199/0xC7 “UltraDMA CRC Error Count”? This should tell you how many checksum errors happened on the SATA interface between the CPU and the drive. If this is higher than zero your hypothesis is correct and there’s something bad with the connection, if it’s zero then the problem is more likely to be elsewhere.