Wednesday, 2022-05-04

*** tpb <[email protected]> has joined #openrisc00:00
shornezx2c4: interesting SIGILL, the address doesnt seem right 0x300a7ce4 its a user space address00:01
zx2c4thats a crash when starting bash00:01
zx2c4ive seen various things sort of resembling that...00:02
zx2c4what this all makes me think is that sometimes there's a jump to the wrong code offset00:02
zx2c4btw, you wont be able to repro with wireguard-linux repo because or1k.config has PREEMPT_NONE=y in it. you have to remove that00:03
shorneI see, any luck reproducing without the initramfs?00:03
zx2c4innnnteresting. no, when i totally disable initramfs it doesnt repro!00:08
zx2c4btw, on mine, i get00:11
zx2c4[    0.000000] Initial ramdisk not found00:11
zx2c4in the fialing case00:11
zx2c4because im using an in-built initramfs00:11
zx2c4rather than one passed via -initramfs00:11
zx2c4CONFIG_BLK_DEV_INITRD=y00:11
zx2c4CONFIG_INITRAMFS_SOURCE="/home/zx2c4/Projects/wireguard-linux/tools/testing/selftests/wireguard/qemu/build/or1k/init-cpio-spec.txt"00:11
zx2c4CONFIG_INITRAMFS_ROOT_UID=000:11
zx2c4CONFIG_INITRAMFS_ROOT_GID=000:11
zx2c4where that file contains https://xn--4db.cc/8XjbyUbr00:12
shorneyeah, I build with :00:13
shornemake -j12 LC_ALL=en_US.UTF-8 ARCH=openrisc CROSS_COMPILE=or1k-linux-musl- CONFIG_INITRAMFS_SOURCE=/home/shorne/work/openrisc/or1k-utils/initramfs /home/shorne/work/openrisc/or1k-utils/initramfs.devnodes00:13
shorneto try to reproduce a similar CPIO build in image00:13
zx2c4I am able to reproduce even if i reduce that file to a single entry00:14
zx2c4file /init /home/zx2c4/Projects/wireguard-linux/tools/testing/selftests/wireguard/qemu/build/or1k/init 755 0 000:14
zx2c4it reproduced with just that00:14
zx2c4here's the init file https://usercontent.irccloud-cdn.com/file/zeCwEBDE/init00:14
shorneok, let me try that00:14
zx2c4here's the output of my `locale`00:14
zx2c4I'm also able to reproduce with the empty initramfs!00:16
zx2c4CONFIG_BLK_DEV_INITRD=y00:16
zx2c4CONFIG_INITRAMFS_SOURCE=""00:16
zx2c4(the kernel supplies a "default" one in that case)00:16
zx2c4So here's a complete kernel config that exhibits the issue and doesnt have any userland dependencies: https://xn--4db.cc/cCRlQ1AE00:17
zx2c4another interesting quirk:00:18
zx2c4[    3.364000] Segment Routing with IPv600:18
zx2c4[    3.364000] In-situ OAM (IOAM) with IPv600:18
zx2c4[  200.944000] List of all partitions:00:18
zx2c4[  200.944000] No filesystem could mount root, tried: 00:18
zx2c4It jumped to 200 otu of nowhere. I've seen this happen too a few times. The clock goes nuts00:19
shorneoh, that is strange00:19
zx2c4More flawed computation? A different bug? Dunno00:19
zx2c4here's a minimal kernel with no userland that exhibits the bug. `qemu-system-or1k -nodefaults -nographic -cpu or1200 -machine or1k-sim -serial stdio -kernel vmlinux` https://usercontent.irccloud-cdn.com/file/CHD5g4be/vmlinux00:21
shornethat one reproduces on my qemu00:24
shornebut I can't get it with my own build yet00:24
zx2c4use https://xn--4db.cc/cCRlQ1AE with no modifications00:24
shorneTried minimap cpio00:24
shorneWill try no cpio00:24
zx2c4Can you try exactly the contents of https://xn--4db.cc/cCRlQ1AE ?00:24
shornetrying  https://xn--4db.cc/cCRlQ1AE00:25
zx2c4and then maybe that + the musl.cc compiler?00:25
shorneits on musl compiler00:26
shorne make -j12 LC_ALL=en_US.UTF-8 ARCH=openrisc CROSS_COMPILE=or1k-linux-musl-00:26
shornetrying that with your .config00:26
shornemaybe its me passing LC_ALL, or other env vars? SHould be little difference now00:27
zx2c4HOSTCC shouldnt be doing anything interesting00:28
zx2c4I'll try building with `env -i PATH=/usr/bin:/bin make ...`00:29
zx2c4totally empty environment00:30
zx2c4if youre using the same compiler and same everything else, in theory we should be able to compare vmlinux images, right?00:31
shorneI cannot repro with the minimal config00:31
zx2c4i can repro the issue with `env -i PATH=/usr/bin:/bin make ...`00:31
zx2c4oh im setting KBUILD_BUILD_TIMESTAMP to an empty string in my build environment...00:32
zx2c4nope, that's not it. removed that and still reproudces00:33
zx2c4are you compiling my wireguard tree, by the way, on the `stable` branch, or some other tree?00:34
shorneIt would be nice if we could compare images, my kernel version is 00:34
shorne2022-05-01 d53a0fd87c26 Julia Lawall     openrisc: fix typos in comments  (HEAD -> master, shorne/master)00:34
shorne2022-05-01 9f10b44dcefc Jason A. Donenfeld openrisc: define nop command for simulator reboot 00:34
shorne2022-04-24 af2d861d4cd2 Linus Torvalds   Linux 5.18-rc4  (tag: v5.18-rc4)00:34
zx2c4alright same here00:34
shorneI have 2 patches on top of v5.18-rc400:34
zx2c4oh, er00:34
shorneah, maybe thats it?00:35
shornelet me try to get your tree00:35
zx2c4let me check out linus' tree00:35
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc00:37
shornezx2c4: should I be using the stable branch?00:37
zx2c4yea00:37
shorneok, building it now00:39
zx2c4im building rc500:39
shorne[    3.320000] WARNING: CPU: 0 PID: 1 at lib/crypto/curve25519.c:19 curve25519_init+0x38/0x5000:40
zx2c4BINGO00:40
zx2c4!!!00:40
zx2c4whewwww00:40
shorneok its happening right away00:40
zx2c4good00:40
zx2c4i can repro with rc500:41
zx2c4so ostensibly this is a regression somewhere between rc4 and rc5?00:41
shorneyeah, it looks like it00:41
shorneneed to bisect00:41
shornebut again, I need to go soon00:42
shornesorry, I am on vacation with my family this week, Japan has a 3 day holiday tue-thu00:42
zx2c4ahhh00:42
shorneoh, wife is getting mad too much computer00:42
zx2c4haha00:42
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)01:33
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc01:37
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)02:17
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc06:13
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)06:51
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc07:28
shornebisected to e5be15767e7e284351853cbaba80cde8620341fb, hex2bin: make the function hex_to_bin constant-time08:05
shorneafter reverting that on your 'stable' branch it doesnt reproduce08:13
zx2c4Whaaaaa.... That cant possibly be it...09:17
zx2c4shorne: funny enough, https://lists.librecores.org/pipermail/openrisc/2022-May/003922.html also "fixes" it... so basically there's some really subtle bug that most things mask11:11
tpbTitle: [PATCH] openrisc: remove bogus nops and shutdowns (at lists.librecores.org)11:11
zx2c4and we're really lucky that we've now unmasked it now11:11
zx2c4figuring out root cause would be a very very good idea before it's papered over by other things11:11
littlebobeepHmm so yall use QEMU?  No FPGAs?12:18
zx2c4littlebobeep: im just a lowly CI admin14:05
zx2c4im sure shorne has some FGPAs running or1k though14:05
zx2c4FPGAs14:05
littlebobeepI see okay14:09
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)14:24
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc14:27
*** littlebo1eep <littlebo1eep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc14:33
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)14:35
*** littlebo1eep <littlebo1eep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)14:39
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc14:45
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)15:16
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc15:23
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)15:29
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc15:41
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)15:55
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc16:01
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has quit IRC (Ping timeout: 240 seconds)16:42
shorneyeah, I can try on FPRA's19:39
shornebut right now I was to try a few other things19:39
shorneusing my 12.x compiler I cannot trigger the issue20:14
shorneAlso, I cannot reproduce using v5.18-rc520:46
shorneso it seems that particular commit with that particular compiler version 11.2.1 with config20:46
*** littlebobeep <littlebobeep!~alMalsamo@gateway/tor-sasl/almalsamo> has joined #openrisc21:32
shornelittlebobeep: I use FPGA and QEMU22:06
shornebut for this testing we are discussing its QEMU22:07

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!