unofficial mirror of bug-guix@gnu.org 
 help / color / mirror / code / Atom feed
* bug#60803: Cuirass stopped processing jobs for aarch64-linux and x86_64-linux
@ 2023-01-14  5:19 Marius Bakke
  2023-01-15  4:26 ` Marius Bakke
  0 siblings, 1 reply; 2+ messages in thread
From: Marius Bakke @ 2023-01-14  5:19 UTC (permalink / raw)
  To: 60803; +Cc: othacehe

Hello Guix,

Cuirass has stopped processing (old) jobs for aarch64 and x86_64.  After
digging through the database it's because (db-get-pending-build ...)
returns a build that is missing from the Jobs table:

  WITH pending_dependencies AS
  (SELECT Builds.id, count(dep.id) as deps FROM Builds
  LEFT JOIN BuildDependencies as bd ON bd.source = Builds.id
  LEFT JOIN Builds AS dep ON bd.target = dep.id AND dep.status != 0
  WHERE Builds.status = -2 AND Builds.system = 'x86_64-linux'
  GROUP BY builds.id
  ORDER BY Builds.priority ASC, Builds.timestamp DESC)
  SELECT id FROM pending_dependencies where deps = 0 limit 1;

     id
  --------
   335212

However:

  select * from jobs  where  build = 335212;
   name | evaluation | build | status | system
  ------+------------+-------+--------+--------
  (0 rows)

For clarity:

  select id,derivation,evaluation,job_name,nix_name,status from builds where id = 335212;
     id   |                            derivation                             | evaluation |       job_name        |     nix_name      | status
  --------+-------------------------------------------------------------------+------------+-----------------------+-------------------+--------
   335212 | /gnu/store/yzgcza0nijnp79mzz878q9a61p6jykgh-perftest-4.5-0.20.drv |     103435 | perftest.x86_64-linux | perftest-4.5-0.20 |     -2

The derivation is also missing from the Outputs table, which causes the
monster query in (db-get-builds ...), which is what workers call to
fetch the next job, to return nothing.

335212 belongs to evaluation 103435 according to the above query, but
does not show up here:

  https://ci.guix.gnu.org/eval/103435?all=&paginate=0

The build id sequence appears to belong to this evaluation:

  https://ci.guix.gnu.org/eval/103436?all=&paginate=0

(notice how it has 335211 and 335213).

I'm not sure how to recover from this.  Either manually create the
entries in Jobs and Outputs, or delete the offending Builds entry?

The 335212 build is for x86_64-linux, we have the same problem with
335087 (also perftest) on aarch64.  i686-linux and powerpc64le-linux is
fine.

Ideas?




^ permalink raw reply	[flat|nested] 2+ messages in thread

* bug#60803: Cuirass stopped processing jobs for aarch64-linux and x86_64-linux
  2023-01-14  5:19 bug#60803: Cuirass stopped processing jobs for aarch64-linux and x86_64-linux Marius Bakke
@ 2023-01-15  4:26 ` Marius Bakke
  0 siblings, 0 replies; 2+ messages in thread
From: Marius Bakke @ 2023-01-15  4:26 UTC (permalink / raw)
  To: 60803; +Cc: othacehe

[-- Attachment #1: Type: text/plain, Size: 719 bytes --]

Marius Bakke <marius@gnu.org> skriver:

> The 335212 build is for x86_64-linux, we have the same problem with
> 335087 (also perftest) on aarch64.  i686-linux and powerpc64le-linux is
> fine.

I deleted these two from the Builds and BuildDependencies tables which
allowed Cuirass to move forward (or backwards, really, as it was
processing new jobs just fine).

Not sure how to mitigate the problem (race when two evaluations create
different derivations with identical outputs at the same time?), but at
least we know how to deal with it.

Speaking of builds, I started debugging #60016 and accidentally deleted
build 175246!  Enough late night debugging for me...  I'll set up my own
Cuirass to experiment on "soon".

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 247 bytes --]

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2023-01-15  4:27 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-01-14  5:19 bug#60803: Cuirass stopped processing jobs for aarch64-linux and x86_64-linux Marius Bakke
2023-01-15  4:26 ` Marius Bakke

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/guix.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).