Saturday, 2022-07-30

*** tpb <[email protected]> has joined #openrisc00:00
shorneyeah, I think your test does way more io00:12
shornezx2c4: so probably it points to something with the irq handling hanging (I fixed some bugs related to this) probably not all00:12
zx2c4when hung, cpu usage of the process was 000:12
zx2c4so probably mutex deadlock thing00:12
shorneI did some kernel patches to try to detect IPI hangs (i.e. time start of ip vs time complete > 10ms), but that didn't reveal anything00:13
shorneso its deadlocking somewhere else00:13
shornemy guess is its still in qemu somewhere00:14
zx2c4yea00:15
shornezx2c4: I see you already got the patch about corrupt stack in there00:19
zx2c4shorne: no real stack corruption. just a "what if" thing linus wanted00:21
shorneyeah, I didn't read the code, just the mail from linus and you.  I thought it was the actual call stack, but this is just an array called stack00:22
shornewell, I guess its on the stack00:22
shorne[  310.456000] reboot: Restarting system00:36
zx2c4shorne: wow it finished for you00:46
zx2c4mine did not00:46
shorne[  318.944000] reboot: Restarting system01:01
zx2c4shorne: i applied your patches on top of qemu master branch01:05
shorneyours is on 5.19-rc8, this is the linux branch I am using right now: https://github.com/stffrdhrn/linux/commits/or1k-wireguard-201:05
zx2c4ah01:05
shorneits just my or1k-5.20-updates + the wireguard selftests01:05
shorneon qemu I am also running on top of qemu master basically: https://github.com/stffrdhrn/qemu/commits/or1k-virt-401:06
shornewhat run times do you typically see for the wireguard tests?01:08
shornelike 100 seconds?01:08
zx2c4it hung before and i ctrl+c'd it after 10 minutes01:08
zx2c4oh, on other platforms?01:08
zx2c4lets see01:08
zx2c4https://www.wireguard.com/build-status/01:08
tpbTitle: Build Status - WireGuard (at www.wireguard.com)01:08
shorneI know host system matters, but just wondering what is the ballpark01:08
zx2c4x86 gives `[  100.659072] reboot: machine restart`01:08
zx2c4(that's not TCG'd)01:08
shorne[  175.065665] reboot: Restarting system <-- arm01:09
shorne[  240.112282] reboot: Restarting system <-- riscv3201:09
zx2c4these are concurrent runs though01:09
zx2c4lemme do one in isolation01:09
shorneright, mine is in isolation01:09
shorneinteresting 01:10
shorne[  459.399223] reboot: Restarting system <--- riscv32 net-next01:10
zx2c4likely due to net.git being pushed at the same time01:11
shorneanyway I guess its just load on the server01:11
zx2c4huh, an or1k run just succeeded for me01:11
zx2c4(that previously failed)01:12
zx2c4so i guess there's some race01:12
shorneyeah01:12
shornedefintely some sort of race01:12
shornenow my run is slow... [  634.480000] wireguard: wg0: Interface created01:12
zx2c4oof01:13
shornesuccessful, but01:13
shorne[  670.180000] reboot: Restarting system01:13
zx2c4i guess it escaped the deadlock somehow, eventually01:13
shorne[+] NS0: ip link add dev wg0 type wireguard01:13
shorne[  114.432000] wireguard: wg0: Interface created01:13
shorne[+] NS0: wg setconf wg0 /dev/fd/6301:13
shorne[  560.792000] wireguard: wg0: Peer 14 created01:13
shorne[+] NS0: wg show wg0 allowed-ips01:13
shorne[+] NS0: ip link del wg001:14
shorne[  634.428000] wireguard: wg0: Peer 14 ((einval)) destroyed01:14
zx2c4yea thats where time gets killed for me too01:14
shorneit usually had the delay around this point01:14
shornesometimes 1 minute01:14
shornesometims 501:14
zx2c4it's when generating a massive string in bash01:14
shornesometimes 10+01:14
zx2c4which involves lots of memory usage and allocations01:14
shorneI remember you mentioned that before, how were you able to observe that is bash allocating a string before?01:15
shorneyou just know thats what its doing there?01:15
zx2c4yea01:15
zx2c4ive starred at this test a lot :P01:15
zx2c4ARCH=arm in isolation: [  125.600957] reboot: Restarting system01:15
shorneright, so I should be able to get around 20001:16
shornemy compiler is not very well optimized compared to what they have invested in the compiler and instruction set in arm :)01:16
zx2c4https://www.irccloud.com/pastebin/xPjmUA25/01:17
tpbTitle: Snippet | IRCCloud (at www.irccloud.com)01:17
zx2c4ARCH=riscv32 in isolation: [  136.358275] reboot: Restarting system01:20
shorneI see, that config it creates is big01:21
shornebut not super huge01:24
shorne~2.4mb01:25
shornen0 wg setconf wg0 <(printf '%s\n' "${config[@]}"01:29
shorneand this first one with 255x255 seems to be the one thats taking the most time01:30

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!