unofficial mirror of bug-guix@gnu.org 
 help / color / mirror / code / Atom feed
* bug#43132: [GUIX SYSTEM]: Malfunction
@ 2020-08-31  9:48 Raghav Gururajan
  2020-08-31  9:53 ` Efraim Flashner
                   ` (2 more replies)
  0 siblings, 3 replies; 18+ messages in thread
From: Raghav Gururajan @ 2020-08-31  9:48 UTC (permalink / raw)
  To: 43132


[-- Attachment #1.1: Type: text/plain, Size: 2394 bytes --]

Hello Guix!

[1] Out of no where, when I did `guix environment foo`, I got:

\note: build failure may have been caused by lack of free disk space
builder for `/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv'
failed with exit code 1

[2] When I redid the command 2nd time, I got:

error (ignored): cannot unlink `/tmp/guix-build-profile.drv-0':
Read-only file system
error (ignored): cannot unlink
`/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv.chroot/tmp/guix-build-profile.drv-0':
Read-only file system
guix environment: error: cannot link
`/gnu/store/.links/1jd7y4xvj853m4aygnyixci5h2y7a1py6iavp9kwzvcinyniqwbd' to
`/gnu/store/3klrs2bkcmypwnmx61q24rc7csgk19f8-profile/share/icons/Adwaita/64x64/emotes/face-smile-big
symbolic.symbolic.png': Read-only file system

[3] When I redid the command 3rd time, I got:

guix environment: error: fport_read: Connection reset by peer

[4] When I redid the command 4th time, I got:

guix environment: error: failed to connect to
`/var/guix/daemon-socket/socket': Connection refused

[5] So I tried to restart guix-daemon and got a weird output:

sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
Password:
Service guix-daemon is not running.
Service guix-daemon is currently disabled.

[6] Then I tried to enable the daemon:

sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
Password:
Enabled service guix-daemon.

[7] Then I tried to start the daemon:

sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
Password:
Service guix-daemon has been started.

[8] Now, I retried the `guix environment foo` and got same error as in 4.

[9] At this point, all the other running applications started to throw
errors regarding read-only file-system. I could not even save the above
errors in a text editor. Glad that I had the IceCat running and I was
able to email it to myself. IceCat wasn't affected, as I think the
web-process was containerized. Everything was back to normal after restart.

[10] I am experiencing this situation for the 3rd time this month. It
never happened before this month.

INFO:

`guix describe`

  guix dad963a
    repository URL: https://git.savannah.gnu.org/git/guix.git
    commit: dad963a4393ea51409baa63817b26b449ed58338

Both my user profile and root profile are on the same commit.

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31  9:48 bug#43132: [GUIX SYSTEM]: Malfunction Raghav Gururajan
@ 2020-08-31  9:53 ` Efraim Flashner
  2020-08-31 10:01   ` Raghav Gururajan
  2020-08-31 10:39 ` Giovanni Biscuolo
  2020-08-31 21:17 ` Danny Milosavljevic
  2 siblings, 1 reply; 18+ messages in thread
From: Efraim Flashner @ 2020-08-31  9:53 UTC (permalink / raw)
  To: Raghav Gururajan; +Cc: 43132

[-- Attachment #1: Type: text/plain, Size: 2955 bytes --]

On Mon, Aug 31, 2020 at 05:48:30AM -0400, Raghav Gururajan wrote:
> Hello Guix!
> 
> [1] Out of no where, when I did `guix environment foo`, I got:
> 
> \note: build failure may have been caused by lack of free disk space
> builder for `/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv'
> failed with exit code 1
> 
> [2] When I redid the command 2nd time, I got:
> 
> error (ignored): cannot unlink `/tmp/guix-build-profile.drv-0':
> Read-only file system
> error (ignored): cannot unlink
> `/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv.chroot/tmp/guix-build-profile.drv-0':
> Read-only file system
> guix environment: error: cannot link
> `/gnu/store/.links/1jd7y4xvj853m4aygnyixci5h2y7a1py6iavp9kwzvcinyniqwbd' to
> `/gnu/store/3klrs2bkcmypwnmx61q24rc7csgk19f8-profile/share/icons/Adwaita/64x64/emotes/face-smile-big
> symbolic.symbolic.png': Read-only file system
> 
> [3] When I redid the command 3rd time, I got:
> 
> guix environment: error: fport_read: Connection reset by peer
> 
> [4] When I redid the command 4th time, I got:
> 
> guix environment: error: failed to connect to
> `/var/guix/daemon-socket/socket': Connection refused
> 
> [5] So I tried to restart guix-daemon and got a weird output:
> 
> sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
> Password:
> Service guix-daemon is not running.
> Service guix-daemon is currently disabled.
> 
> [6] Then I tried to enable the daemon:
> 
> sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
> Password:
> Enabled service guix-daemon.
> 
> [7] Then I tried to start the daemon:
> 
> sudo: unable to open /var/run/sudo/ts/rg: Read-only file system
> Password:
> Service guix-daemon has been started.
> 
> [8] Now, I retried the `guix environment foo` and got same error as in 4.
> 
> [9] At this point, all the other running applications started to throw
> errors regarding read-only file-system. I could not even save the above
> errors in a text editor. Glad that I had the IceCat running and I was
> able to email it to myself. IceCat wasn't affected, as I think the
> web-process was containerized. Everything was back to normal after restart.
> 
> [10] I am experiencing this situation for the 3rd time this month. It
> never happened before this month.
> 
> INFO:
> 
> `guix describe`
> 
>   guix dad963a
>     repository URL: https://git.savannah.gnu.org/git/guix.git
>     commit: dad963a4393ea51409baa63817b26b449ed58338
> 
> Both my user profile and root profile are on the same commit.
> 
> Regards,
> RG.
> 


What's the output of 'df -h' and 'df -i'? There's not much change in
error message if you're out of space or just out of inodes.


-- 
Efraim Flashner   <efraim@flashner.co.il>   אפרים פלשנר
GPG key = A28B F40C 3E55 1372 662D  14F7 41AA E7DC CA3D 8351
Confidentiality cannot be guaranteed on emails sent or received unencrypted

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31  9:53 ` Efraim Flashner
@ 2020-08-31 10:01   ` Raghav Gururajan
  0 siblings, 0 replies; 18+ messages in thread
From: Raghav Gururajan @ 2020-08-31 10:01 UTC (permalink / raw)
  To: Efraim Flashner; +Cc: 43132


[-- Attachment #1.1: Type: text/plain, Size: 1135 bytes --]

Hi Efraim!

> What's the output of 'df -h' and 'df -i'? There's not much change in
> error message if you're out of space or just out of inodes.

rg@secondary ~$ df -h
Filesystem      Size  Used Avail Use% Mounted on
none            3.8G     0  3.8G   0% /dev
/dev/dm-0       120G   95G   24G  81% /
tmpfs           3.8G  8.6M  3.8G   1% /dev/shm
none            3.8G   20K  3.8G   1% /run/systemd
none            3.8G     0  3.8G   0% /run/user
cgroup          3.8G     0  3.8G   0% /sys/fs/cgroup
tmpfs           771M  4.0K  771M   1% /run/user/1000
/dev/sdb1        60G   59G  638M  99% /media/rg/CARD
rg@secondary ~$ df -i
Filesystem     Inodes IUsed  IFree IUse% Mounted on
none           983678   592 983086    1% /dev
/dev/dm-0           0     0      0     - /
tmpfs          986453    71 986382    1% /dev/shm
none           986453    21 986432    1% /run/systemd
none           986453     2 986451    1% /run/user
cgroup         986453    12 986441    1% /sys/fs/cgroup
tmpfs          986453    14 986439    1% /run/user/1000
/dev/sdb1           0     0      0     - /media/rg/CARD

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31  9:48 bug#43132: [GUIX SYSTEM]: Malfunction Raghav Gururajan
  2020-08-31  9:53 ` Efraim Flashner
@ 2020-08-31 10:39 ` Giovanni Biscuolo
  2020-08-31 10:48   ` Raghav Gururajan
  2020-08-31 21:17 ` Danny Milosavljevic
  2 siblings, 1 reply; 18+ messages in thread
From: Giovanni Biscuolo @ 2020-08-31 10:39 UTC (permalink / raw)
  To: Raghav Gururajan, 43132

[-- Attachment #1: Type: text/plain, Size: 401 bytes --]

Hi Raghav

Raghav Gururajan <raghavgururajan@disroot.org> writes:

[...]

> [2] When I redid the command 2nd time, I got:
>
> error (ignored): cannot unlink `/tmp/guix-build-profile.drv-0':
> Read-only file system

It seems connected to a filesystem issue: can you also tell us what's
the output of "mount"?

[...]

Thanks, Gio'

-- 
Giovanni Biscuolo

Xelera IT Infrastructures

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 832 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31 10:39 ` Giovanni Biscuolo
@ 2020-08-31 10:48   ` Raghav Gururajan
  2020-08-31 11:11     ` Giovanni Biscuolo
  0 siblings, 1 reply; 18+ messages in thread
From: Raghav Gururajan @ 2020-08-31 10:48 UTC (permalink / raw)
  To: Giovanni Biscuolo, 43132


[-- Attachment #1.1: Type: text/plain, Size: 2015 bytes --]

Hi Gio!


> It seems connected to a filesystem issue: can you also tell us what's
> the output of "mount"?

none on /proc type proc (rw,relatime)
none on /dev type devtmpfs
(rw,relatime,size=3934712k,nr_inodes=983678,mode=755)
none on /sys type sysfs (rw,relatime)
/dev/mapper/secondary on / type btrfs
(rw,relatime,ssd,space_cache,subvolid=5,subvol=/)
none on /dev/pts type devpts (rw,relatime,gid=996,mode=620,ptmxmode=000)
none on /sys/kernel/debug type debugfs (rw,relatime)
tmpfs on /dev/shm type tmpfs (rw,nosuid,nodev,relatime)
/dev/mapper/secondary on /gnu/store type btrfs
(ro,relatime,ssd,space_cache,subvolid=5,subvol=/)
binfmt_misc on /proc/sys/fs/binfmt_misc type binfmt_misc (rw,relatime)
none on /run/systemd type tmpfs (rw,nosuid,nodev,noexec,relatime,mode=755)
none on /run/user type tmpfs (rw,nosuid,nodev,noexec,relatime,mode=755)
cgroup on /sys/fs/cgroup type tmpfs (rw,relatime)
cgroup on /sys/fs/cgroup/elogind type cgroup (rw,relatime,name=elogind)
cgroup on /sys/fs/cgroup/cpuset type cgroup (rw,relatime,cpuset)
cgroup on /sys/fs/cgroup/cpu type cgroup (rw,relatime,cpu)
cgroup on /sys/fs/cgroup/cpuacct type cgroup (rw,relatime,cpuacct)
cgroup on /sys/fs/cgroup/memory type cgroup (rw,relatime,memory)
cgroup on /sys/fs/cgroup/devices type cgroup (rw,relatime,devices)
cgroup on /sys/fs/cgroup/freezer type cgroup (rw,relatime,freezer)
cgroup on /sys/fs/cgroup/blkio type cgroup (rw,relatime,blkio)
cgroup on /sys/fs/cgroup/perf_event type cgroup (rw,relatime,perf_event)
cgroup on /sys/fs/cgroup/pids type cgroup (rw,relatime,pids)
cgroup2 on /sys/fs/cgroup/unified type cgroup2
(rw,nosuid,nodev,noexec,relatime,nsdelegate)
tmpfs on /run/user/1000 type tmpfs
(rw,nosuid,nodev,relatime,size=789160k,mode=700,uid=1000,gid=998)
/dev/sdb1 on /media/rg/CARD type vfat
(rw,nosuid,nodev,relatime,uid=1000,gid=998,fmask=0022,dmask=0022,codepage=437,iocharset=iso8859-1,shortname=mixed,showexec,utf8,flush,errors=remount-ro,uhelper=udisks2)

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31 10:48   ` Raghav Gururajan
@ 2020-08-31 11:11     ` Giovanni Biscuolo
  2020-08-31 12:07       ` Julien Lepiller
  0 siblings, 1 reply; 18+ messages in thread
From: Giovanni Biscuolo @ 2020-08-31 11:11 UTC (permalink / raw)
  To: Raghav Gururajan, 43132

[-- Attachment #1: Type: text/plain, Size: 1069 bytes --]

Hello Raghav

when forwarding the output of commands next time, plz beware your MUS
does not reformat the relevant :-)

This seems as a system issue on your side, not a Guix bug

Raghav Gururajan <raghavgururajan@disroot.org> writes:

>> It seems connected to a filesystem issue: can you also tell us what's
>> the output of "mount"?

[...]

> w on / type btrfs
> (rw,relatime,ssd,space_cache,subvolid=5,subvol=/)

[...]

> /dev/mapper/secondary on /gnu/store type btrfs
> (ro,relatime,ssd,space_cache,subvolid=5,subvol=/)

I see two problems here:

1. the btrfs volume /dev/mapper/secondary seems mounted twice, and with
the same subvolume; I never tryed to mount the same btrfs volume on two
different mountpoints: is this the reason your /gnu/store is read-only?

2. /gnu/store is mounted read-only, that's why you get the errors

Please can you try removing the mounting of /gnu/store from your
filesystem configuration (or fstab if on a foreign distro)? 

[...]

HTH! Gio'

-- 
Giovanni Biscuolo

Xelera IT Infrastructures

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 832 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31 11:11     ` Giovanni Biscuolo
@ 2020-08-31 12:07       ` Julien Lepiller
  2020-08-31 12:40         ` Giovanni Biscuolo
  0 siblings, 1 reply; 18+ messages in thread
From: Julien Lepiller @ 2020-08-31 12:07 UTC (permalink / raw)
  To: Giovanni Biscuolo, Raghav Gururajan, 43132

[-- Attachment #1: Type: text/plain, Size: 1418 bytes --]

No, it's supposed to be like that. /gnu/store is mounted read-only (on the guix system) to prevent you from writing to it. The guix daemon has write access to the store when it wants to add a new item, or garbage collect.

Le 31 août 2020 07:11:13 GMT-04:00, Giovanni Biscuolo <g@xelera.eu> a écrit :
>Hello Raghav
>
>when forwarding the output of commands next time, plz beware your MUS
>does not reformat the relevant :-)
>
>This seems as a system issue on your side, not a Guix bug
>
>Raghav Gururajan <raghavgururajan@disroot.org> writes:
>
>>> It seems connected to a filesystem issue: can you also tell us
>what's
>>> the output of "mount"?
>
>[...]
>
>> w on / type btrfs
>> (rw,relatime,ssd,space_cache,subvolid=5,subvol=/)
>
>[...]
>
>> /dev/mapper/secondary on /gnu/store type btrfs
>> (ro,relatime,ssd,space_cache,subvolid=5,subvol=/)
>
>I see two problems here:
>
>1. the btrfs volume /dev/mapper/secondary seems mounted twice, and with
>the same subvolume; I never tryed to mount the same btrfs volume on two
>different mountpoints: is this the reason your /gnu/store is read-only?
>
>2. /gnu/store is mounted read-only, that's why you get the errors
>
>Please can you try removing the mounting of /gnu/store from your
>filesystem configuration (or fstab if on a foreign distro)? 
>
>[...]
>
>HTH! Gio'
>
>-- 
>Giovanni Biscuolo
>
>Xelera IT Infrastructures

[-- Attachment #2: Type: text/html, Size: 2156 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31 12:07       ` Julien Lepiller
@ 2020-08-31 12:40         ` Giovanni Biscuolo
  0 siblings, 0 replies; 18+ messages in thread
From: Giovanni Biscuolo @ 2020-08-31 12:40 UTC (permalink / raw)
  To: Julien Lepiller, Raghav Gururajan, 43132

[-- Attachment #1: Type: text/plain, Size: 1009 bytes --]

Julien Lepiller <julien@lepiller.eu> writes:

> No, it's supposed to be like that. /gnu/store is mounted read-only (on
> the guix system) to prevent you from writing to it.

Very sorry for the confusion, I forgot that! (and I did not check before) :-(

>>> /dev/mapper/secondary on / type btrfs
>>> (rw,relatime,ssd,space_cache,subvolid=5,subvol=/)
>>
>>[...]
>>
>>> /dev/mapper/secondary on /gnu/store type btrfs
>>> (ro,relatime,ssd,space_cache,subvolid=5,subvol=/)
                                                   ^ Same subvolume as /

This is the output from a running Guix system of mine:

--8<---------------cut here---------------start------------->8---

/dev/sda5 on / type btrfs (rw,relatime,space_cache,subvolid=5,subvol=/)

[...]

/dev/sda5 on /gnu/store type btrfs (ro,relatime,space_cache,subvolid=5,subvol=/gnu/store)

--8<---------------cut here---------------end--------------->8---

Thanks! Gio'

[...]

-- 
Giovanni Biscuolo

Xelera IT Infrastructures

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 832 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31  9:48 bug#43132: [GUIX SYSTEM]: Malfunction Raghav Gururajan
  2020-08-31  9:53 ` Efraim Flashner
  2020-08-31 10:39 ` Giovanni Biscuolo
@ 2020-08-31 21:17 ` Danny Milosavljevic
  2020-09-01  3:04   ` Raghav Gururajan
  2 siblings, 1 reply; 18+ messages in thread
From: Danny Milosavljevic @ 2020-08-31 21:17 UTC (permalink / raw)
  To: Raghav Gururajan; +Cc: 43132

[-- Attachment #1: Type: text/plain, Size: 1175 bytes --]

Hi Raghav,

On Mon, 31 Aug 2020 05:48:30 -0400
Raghav Gururajan <raghavgururajan@disroot.org> wrote:

> Hello Guix!
> 
> [1] Out of no where, when I did `guix environment foo`, I got:
> 
> \note: build failure may have been caused by lack of free disk space
> builder for `/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv'
> failed with exit code 1
> 
> [2] When I redid the command 2nd time, I got:
> 
> error (ignored): cannot unlink `/tmp/guix-build-profile.drv-0':
> Read-only file system
> error (ignored): cannot unlink
> `/gnu/store/2ajnpcblwpgzjdhx3050qapy3li31pr5-profile.drv.chroot/tmp/guix-build-profile.drv-0':
> Read-only file system
> guix environment: error: cannot link
> `/gnu/store/.links/1jd7y4xvj853m4aygnyixci5h2y7a1py6iavp9kwzvcinyniqwbd' to
> `/gnu/store/3klrs2bkcmypwnmx61q24rc7csgk19f8-profile/share/icons/Adwaita/64x64/emotes/face-smile-big
> symbolic.symbolic.png': Read-only file system

Usually that means file-system corruption, which very likely was caused by a
hardware (disk) problem.  I've had it before, and shortly after the disk died.

What does "sudo dmesg" show around the time it made it read-only?

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-08-31 21:17 ` Danny Milosavljevic
@ 2020-09-01  3:04   ` Raghav Gururajan
  2020-09-01  7:58     ` Danny Milosavljevic
  2020-09-10  7:56     ` Ludovic Courtès
  0 siblings, 2 replies; 18+ messages in thread
From: Raghav Gururajan @ 2020-09-01  3:04 UTC (permalink / raw)
  To: Danny Milosavljevic; +Cc: 43132


[-- Attachment #1.1: Type: text/plain, Size: 456 bytes --]

Hi Danny!

> Usually that means file-system corruption, which very likely was caused by a
> hardware (disk) problem.  I've had it before, and shortly after the disk died.

Oh no! My disk is a SSD, which is only about 2 years old. Isn't that too
soon?

Btw, is there a tool to check the health of the disk?

> What does "sudo dmesg" show around the time it made it read-only?

Ah, I will have to wait until it happens again.

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-01  3:04   ` Raghav Gururajan
@ 2020-09-01  7:58     ` Danny Milosavljevic
  2020-09-10  7:56     ` Ludovic Courtès
  1 sibling, 0 replies; 18+ messages in thread
From: Danny Milosavljevic @ 2020-09-01  7:58 UTC (permalink / raw)
  To: Raghav Gururajan; +Cc: 43132

[-- Attachment #1: Type: text/plain, Size: 846 bytes --]

Hi,

On Mon, 31 Aug 2020 23:04:25 -0400
Raghav Gururajan <raghavgururajan@disroot.org> wrote:

> Hi Danny!
> 
> > Usually that means file-system corruption, which very likely was caused by a
> > hardware (disk) problem.  I've had it before, and shortly after the disk died.  
> 
> Oh no! My disk is a SSD, which is only about 2 years old. Isn't that too
> soon?
> 
> Btw, is there a tool to check the health of the disk?

Yes--usually it's a program in the disk firmware.

You can steer it and look at what it did using smartctl (in package
smartmontools).

But I'd advise to check dmesg because it could also be a RAM problem, or a
number of other things.

(UNIX also has fsck to check the filesystem, but it already automatically
does that on reboot when problems arised.  So little need to manually
fiddle with that)

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-01  3:04   ` Raghav Gururajan
  2020-09-01  7:58     ` Danny Milosavljevic
@ 2020-09-10  7:56     ` Ludovic Courtès
  2020-09-14 15:14       ` Maxim Cournoyer
  1 sibling, 1 reply; 18+ messages in thread
From: Ludovic Courtès @ 2020-09-10  7:56 UTC (permalink / raw)
  To: Raghav Gururajan; +Cc: 43132

Hey Raghav,

Did you eventually find what went wrong?  Should we close this bug or at
least retitle it?

Thanks,
Ludo’.




^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-10  7:56     ` Ludovic Courtès
@ 2020-09-14 15:14       ` Maxim Cournoyer
  2020-09-14 19:34         ` Ludovic Courtès
  0 siblings, 1 reply; 18+ messages in thread
From: Maxim Cournoyer @ 2020-09-14 15:14 UTC (permalink / raw)
  To: Ludovic Courtès; +Cc: 43132-done, Raghav Gururajan

Hello,

Ludovic Courtès <ludo@gnu.org> writes:

> Hey Raghav,
>
> Did you eventually find what went wrong?  Should we close this bug or at
> least retitle it?
>
> Thanks,
> Ludo’.

I took Raghav to #btrfs last week, where with the help of gentle folks a
failing drive was established as the most likely culprit.

In other words, Btrfs checksuming capabilities helped quickly
discovering a hardware problem which might otherwise have silently
caused non-recoverable damage to Raghav's data.

I'm closing this bug now.

Thanks!

Maxim




^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-14 15:14       ` Maxim Cournoyer
@ 2020-09-14 19:34         ` Ludovic Courtès
  2020-09-15 11:51           ` Raghav Gururajan
  0 siblings, 1 reply; 18+ messages in thread
From: Ludovic Courtès @ 2020-09-14 19:34 UTC (permalink / raw)
  To: Maxim Cournoyer; +Cc: 43132-done, Raghav Gururajan

Hi,

Maxim Cournoyer <maxim.cournoyer@gmail.com> skribis:

> I took Raghav to #btrfs last week, where with the help of gentle folks a
> failing drive was established as the most likely culprit.
>
> In other words, Btrfs checksuming capabilities helped quickly
> discovering a hardware problem which might otherwise have silently
> caused non-recoverable damage to Raghav's data.

Good, thanks for following up!

Ludo’.




^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-14 19:34         ` Ludovic Courtès
@ 2020-09-15 11:51           ` Raghav Gururajan
  2020-09-15 13:31             ` Maxim Cournoyer
  0 siblings, 1 reply; 18+ messages in thread
From: Raghav Gururajan @ 2020-09-15 11:51 UTC (permalink / raw)
  To: Ludovic Courtès, Maxim Cournoyer; +Cc: 43132-done


[-- Attachment #1.1: Type: text/plain, Size: 726 bytes --]

Hi!

> Maxim Cournoyer <maxim.cournoyer@gmail.com> skribis:
> 
>> I took Raghav to #btrfs last week, where with the help of gentle folks a
>> failing drive was established as the most likely culprit.
>>
>> In other words, Btrfs checksuming capabilities helped quickly
>> discovering a hardware problem which might otherwise have silently
>> caused non-recoverable damage to Raghav's data.
> 
> Good, thanks for following up!
> 
> Ludo’.

Thank you!

Yeah, seems like my disk is shot, but I am not sure. I have reinstalled
guix with ext4, instead of btrfs, as these issues started to arise after
migration to btrfs from ext4. So far, my system is doing well. Lets see
how it goes. :-)

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-15 11:51           ` Raghav Gururajan
@ 2020-09-15 13:31             ` Maxim Cournoyer
  2020-09-15 22:13               ` Mark H Weaver
  0 siblings, 1 reply; 18+ messages in thread
From: Maxim Cournoyer @ 2020-09-15 13:31 UTC (permalink / raw)
  To: Raghav Gururajan; +Cc: 43132-done

Hello Raghav,

Raghav Gururajan <raghavgururajan@disroot.org> writes:

> Hi!
>
>> Maxim Cournoyer <maxim.cournoyer@gmail.com> skribis:
>> 
>>> I took Raghav to #btrfs last week, where with the help of gentle folks a
>>> failing drive was established as the most likely culprit.
>>>
>>> In other words, Btrfs checksuming capabilities helped quickly
>>> discovering a hardware problem which might otherwise have silently
>>> caused non-recoverable damage to Raghav's data.
>> 
>> Good, thanks for following up!
>> 
>> Ludo’.
>
> Thank you!
>
> Yeah, seems like my disk is shot, but I am not sure. I have reinstalled
> guix with ext4, instead of btrfs, as these issues started to arise after
> migration to btrfs from ext4. So far, my system is doing well. Lets see
> how it goes. :-)

Sounds like playing with fire to me :-).

Ext4 won't detect bitrot (silent corruption of your drive's data).
You'll probably wake one day with a fsck that won't be able to recover
some files, or worst, a completely dead drive.

Your backups would also contain corrupted data (garbage in, garbage
out!).

Maxim




^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-15 13:31             ` Maxim Cournoyer
@ 2020-09-15 22:13               ` Mark H Weaver
  2020-09-15 22:38                 ` Raghav Gururajan
  0 siblings, 1 reply; 18+ messages in thread
From: Mark H Weaver @ 2020-09-15 22:13 UTC (permalink / raw)
  To: Maxim Cournoyer, Raghav Gururajan; +Cc: 43132-done

Maxim Cournoyer <maxim.cournoyer@gmail.com> writes:

> Raghav Gururajan <raghavgururajan@disroot.org> writes:
>
>>> Maxim Cournoyer <maxim.cournoyer@gmail.com> skribis:
>>> 
>>>> I took Raghav to #btrfs last week, where with the help of gentle folks a
>>>> failing drive was established as the most likely culprit.
>>>>
>>>> In other words, Btrfs checksuming capabilities helped quickly
>>>> discovering a hardware problem which might otherwise have silently
>>>> caused non-recoverable damage to Raghav's data.
>>> 
>>> Good, thanks for following up!
>>> 
>>> Ludo’.
>>
>> Thank you!
>>
>> Yeah, seems like my disk is shot, but I am not sure. I have reinstalled
>> guix with ext4, instead of btrfs, as these issues started to arise after
>> migration to btrfs from ext4. So far, my system is doing well. Lets see
>> how it goes. :-)
>
> Sounds like playing with fire to me :-).
>
> Ext4 won't detect bitrot (silent corruption of your drive's data).
> You'll probably wake one day with a fsck that won't be able to recover
> some files, or worst, a completely dead drive.
>
> Your backups would also contain corrupted data (garbage in, garbage
> out!).

For what it's worth, I wholeheartedly agree with Maxim.  Btrfs did you a
great service by calling attention to this problem with your drive, and
it would be a shame to ignore it and switch back to ext4 where your data
may instead be silently corrupted.

I've been using btrfs for several years now on my x86_64 Guix system,
and it has served me well.  Previously, I used ext4, which would
silently leave some of my files empty after crashes.  I've never seen
that happen with btrfs.

       Mark




^ permalink raw reply	[flat|nested] 18+ messages in thread

* bug#43132: [GUIX SYSTEM]: Malfunction
  2020-09-15 22:13               ` Mark H Weaver
@ 2020-09-15 22:38                 ` Raghav Gururajan
  0 siblings, 0 replies; 18+ messages in thread
From: Raghav Gururajan @ 2020-09-15 22:38 UTC (permalink / raw)
  To: Mark H Weaver, Maxim Cournoyer; +Cc: 43132-done


[-- Attachment #1.1: Type: text/plain, Size: 918 bytes --]

Hi Mark and Maxim!

>> Ext4 won't detect bitrot (silent corruption of your drive's data).
>> You'll probably wake one day with a fsck that won't be able to recover
>> some files, or worst, a completely dead drive.
>>
>> Your backups would also contain corrupted data (garbage in, garbage
>> out!).
> 
> For what it's worth, I wholeheartedly agree with Maxim.  Btrfs did you a
> great service by calling attention to this problem with your drive, and
> it would be a shame to ignore it and switch back to ext4 where your data
> may instead be silently corrupted.
> 
> I've been using btrfs for several years now on my x86_64 Guix system,
> and it has served me well.  Previously, I used ext4, which would
> silently leave some of my files empty after crashes.  I've never seen
> that happen with btrfs.

Yeah, makes sense. I have placed an order for WDS100T2B0A.

Thanks folks!

Regards,
RG.


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2020-09-15 23:00 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-08-31  9:48 bug#43132: [GUIX SYSTEM]: Malfunction Raghav Gururajan
2020-08-31  9:53 ` Efraim Flashner
2020-08-31 10:01   ` Raghav Gururajan
2020-08-31 10:39 ` Giovanni Biscuolo
2020-08-31 10:48   ` Raghav Gururajan
2020-08-31 11:11     ` Giovanni Biscuolo
2020-08-31 12:07       ` Julien Lepiller
2020-08-31 12:40         ` Giovanni Biscuolo
2020-08-31 21:17 ` Danny Milosavljevic
2020-09-01  3:04   ` Raghav Gururajan
2020-09-01  7:58     ` Danny Milosavljevic
2020-09-10  7:56     ` Ludovic Courtès
2020-09-14 15:14       ` Maxim Cournoyer
2020-09-14 19:34         ` Ludovic Courtès
2020-09-15 11:51           ` Raghav Gururajan
2020-09-15 13:31             ` Maxim Cournoyer
2020-09-15 22:13               ` Mark H Weaver
2020-09-15 22:38                 ` Raghav Gururajan

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/guix.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).