Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
[SOLVED] torque : qmgr: cannot connect to server
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Portage & Programming
View previous topic :: View next topic  
Author Message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Mon Apr 24, 2017 8:47 pm    Post subject: [SOLVED] torque : qmgr: cannot connect to server Reply with quote

Torque
Quote:
sys-cluster/torque-4.1.7-r1::gentoo USE="crypt doc server syslog -cpusets -drmaa (-libressl) -munge -nvidia -tk"
builds and installs
Quote:
>>> Installing (1 of 1) sys-cluster/torque-4.1.7-r1::gentoo
* If this is the first time torque has been installed, then you are not
* ready to start the server. Please refer to the documentation located at:
* http://www.clusterresources.com/wiki/doku.php?id=torque:torque_wiki

* For a basic setup, you may use emerge --config torque

* Important 4.0+ updates
* - The on-wire protocol version has been changed.
* Versions of Torque before 4.0.0 are no longer able to communicate.
* - pbs_iff has been replaced by trqauthd, you will now need to add
* trqauthd to your default runlevel.

>>> Recording sys-cluster/torque in "world" favorites file...

* Messages for package sys-cluster/torque-4.1.7-r1:

* If this is the first time torque has been installed, then you are not
* ready to start the server. Please refer to the documentation located at:
* http://www.clusterresources.com/wiki/doku.php?id=torque:torque_wiki
* For a basic setup, you may use emerge --config torque
* Important 4.0+ updates
* - The on-wire protocol version has been changed.
* Versions of Torque before 4.0.0 are no longer able to communicate.
* - pbs_iff has been replaced by trqauthd, you will now need to add
* trqauthd to your default runlevel.
>>> Auto-cleaning packages...

>>> No outdated packages were found on your system.
but I can't configure it.
Code:
# emerge --config torque

Configuring pkg...

 * Configuring Torque ...
 * Using /var/spool/torque as the pbs homedir
 * Using hp24 as the pbs_server
 * Previous Torque configuration detected.  Press Enter to
 * continue or Control-C to abort now

Unable to communicate with hp24(192.168.0.19)
Communication failure.
qmgr: cannot connect to server hp24 (errno=15096) Unable to get connection to socket
 * ERROR: sys-cluster/torque-4.1.7-r1::gentoo failed (config phase):
 *   (no error message)
 *
 * Call stack:
 *     ebuild.sh, line 115:  Called pkg_config
 *   environment, line 2688:  Called die
 * The specific snippet of code:
 *               ${qmgr} "set server operators = root@$(hostname -f)" ${PBS_SERVER_NAME} && ${qmgr} "create queue batch" ${PBS_SERVER_NAME} && ${qmgr} "set queue batch queue_type = Execution" ${PBS_SERVER_NAME} && ${qmgr} "set queue batch started = True" ${PBS_SERVER_NAME} && ${qmgr} "set queue batch enabled = True" ${PBS_SERVER_NAME} && ${qmgr} "set server default_queue = batch" ${PBS_SERVER_NAME} && ${qmgr} "set server resources_default.nodes = 1" ${PBS_SERVER_NAME} && ${qmgr} "set server scheduling = True" ${PBS_SERVER_NAME} || die;
 *
 * If you need support, post the output of `emerge --info '=sys-cluster/torque-4.1.7-r1::gentoo'`,
 * the complete build log and the output of `emerge -pqv '=sys-cluster/torque-4.1.7-r1::gentoo'`.
 * The complete build log is located at '/var/tmp/portage/sys-cluster/torque-4.1.7-r1/temp/build.log'.
 * The ebuild environment file is located at '/var/tmp/portage/sys-cluster/torque-4.1.7-r1/temp/environment'.
 * Working directory: '/var/tmp/portage/sys-cluster/torque-4.1.7-r1/homedir'
 * S: '/var/tmp/portage/sys-cluster/torque-4.1.7-r1/work/torque-4.1.7'


I believe the problem is with the following which I think is part of what the above emerge does
Code:
 # qmgr -c "set server acl_hosts = hp24"
Unable to communicate with hp24(192.168.0.19)
Communication failure.
qmgr: cannot connect to server  (errno=15096) Unable to get connection to socket

Perhaps the issue has to do not so much with torque as with basic communication. The machine I'm executing the above on is
Code:

# uname -a
Linux hp24 4.9.16-gentoo #2 SMP Fri Apr 21 11:41:23 EDT 2017 x86_64 Six-Core AMD Opteron(tm) Processor 2435 AuthenticAMD GNU/Linux
# ifconfig
eth0: flags=4163<UP,BROADCAST,RUNNING,MULTICAST>  mtu 1500
        inet 192.168.0.19  netmask 255.255.255.0  broadcast 192.168.0.255
        inet6 fe80::1aa9:5ff:fe69:d58  prefixlen 64  scopeid 0x20<link>
        ether 18:a9:05:69:0d:58  txqueuelen 1000  (Ethernet)
        RX packets 69571  bytes 41053855 (39.1 MiB)
        RX errors 0  dropped 0  overruns 0  frame 0
        TX packets 66016  bytes 13911013 (13.2 MiB)
        TX errors 0  dropped 0 overruns 0  carrier 0  collisions 0
        device interrupt 21 

lo: flags=73<UP,LOOPBACK,RUNNING>  mtu 65536
        inet 127.0.0.1  netmask 255.0.0.0
        inet6 ::1  prefixlen 128  scopeid 0x10<host>
        loop  txqueuelen 1  (Local Loopback)
        RX packets 200  bytes 10244 (10.0 KiB)
        RX errors 0  dropped 0  overruns 0  frame 0
        TX packets 200  bytes 10244 (10.0 KiB)
        TX errors 0  dropped 0 overruns 0  carrier 0  collisions 0


Before executing any of the above, I turned off the firewall
Code:
 # /etc/init.d/nftables stop
 * Saving nftables state ...
 * Stopping firewall ...                                                          [ ok ]
and started trqauthd
Code:
# /etc/init.d/trqauthd restart
hostname: hp24
pbs_server port is: 15001
trqauthd daemonized - port 15005
Perhaps unrelated (though likely a problem of sorts), I noticed the following in /var/log/messages
Quote:
Apr 24 09:03:08 hp24 PBS_Server[7728]: LOG_ERROR::Permission denied (13) in chk_file_sec, Security violation with "/var/spool/torque/spool/" - /var/spool/torque/spool/ cannot be accessed
Apr 24 09:03:08 hp24 PBS_Server[7728]: LOG_ERROR::PBS_Server, pbsd_init failed

I don't know if it matters, but strace uncovered that /var/spool/torque/torque.cfg does not exist (and if it should, then I have no idea what exactly it should be / contain).

I can't read/grock strace, but its output is
Code:

mprotect(0x7fd48e552000, 2093056, PROT_NONE) = 0execve("/usr/bin/qmgr", ["qmgr", "-c", "set server acl_hosts = hp24"], [/* 27 vars */])\
 = 0
brk(NULL)                               = 0x1919000
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd49158d0\
00
access("/etc/ld.so.preload", R_OK)      = -1 ENOENT (No such file or directory)
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=34104, ...}) = 0
mmap(NULL, 34104, PROT_READ, MAP_PRIVATE, 3, 0) = 0x7fd491584000
close(3)                                = 0
open("/usr/lib64/libtorque.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\300\0\1\0\0\0\0\0"..., 832) = 8\
32
fstat(3, {st_mode=S_IFREG|0755, st_size=334280, ...}) = 0
mmap(NULL, 9293952, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd490a8d\
000
mprotect(0x7fd490adc000, 2093056, PROT_NONE) = 0
mmap(0x7fd490cdb000, 16384, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x4e000) = 0x7fd490cdb000
mmap(0x7fd490cdf000, 6860928, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS\
, -1, 0) = 0x7fd490cdf000
close(3)                                = 0
open("/lib64/libreadline.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\320b\1\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0555, st_size=296792, ...}) = 0
mmap(NULL, 2398008, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd490843\
000
mprotect(0x7fd490884000, 2093056, PROT_NONE) = 0
mmap(0x7fd490a83000, 32768, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x40000) = 0x7fd490a83000
mmap(0x7fd490a8b000, 5944, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -\
1, 0) = 0x7fd490a8b000
close(3)                                = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd4915830\
00
open("/lib64/libc.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\3\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\240\10\2\0\0\0\0\0"..., 832) = \
832
fstat(3, {st_mode=S_IFREG|0755, st_size=1685320, ...}) = 0
mmap(NULL, 3792216, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd4904a5\
000
mprotect(0x7fd490639000, 2097152, PROT_NONE) = 0
mmap(0x7fd490839000, 24576, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x194000) = 0x7fd490839000
mmap(0x7fd49083f000, 15704, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, \
-1, 0) = 0x7fd49083f000
close(3)                                = 0
open("/usr/lib64/libxml2.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0`\340\2\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0755, st_size=1479440, ...}) = 0
mmap(NULL, 3580056, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd49013a\
000
mprotect(0x7fd490299000, 2097152, PROT_NONE) = 0
mmap(0x7fd490499000, 40960, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x15f000) = 0x7fd490499000
mmap(0x7fd4904a3000, 4248, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -\
1, 0) = 0x7fd4904a3000
close(3)                                = 0
open("/lib64/libz.so.1", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0@%\0\0\0\0\0\0"..., 832) = 832
fstat(3, {st_mode=S_IFREG|0755, st_size=92464, ...}) = 0
mmap(NULL, 2187664, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48ff23\
000
mprotect(0x7fd48ff39000, 2093056, PROT_NONE) = 0
mmap(0x7fd490138000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x15000) = 0x7fd490138000
close(3)                                = 0
open("/lib64/libm.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\3\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0@U\0\0\0\0\0\0"..., 832) = 832
fstat(3, {st_mode=S_IFREG|0755, st_size=1067144, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd4915820\
00
mmap(NULL, 3162400, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48fc1e\
000
mprotect(0x7fd48fd22000, 2093056, PROT_NONE) = 0
mmap(0x7fd48ff21000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x103000) = 0x7fd48ff21000
close(3)                                = 0
open("/lib64/libdl.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\200\r\0\0\0\0\0\0"..., 832) = 8\
32
fstat(3, {st_mode=S_IFREG|0755, st_size=14360, ...}) = 0
mmap(NULL, 2109680, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48fa1a\
000
mprotect(0x7fd48fa1c000, 2097152, PROT_NONE) = 0
mmap(0x7fd48fc1c000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x2000) = 0x7fd48fc1c000
close(3)                                = 0
open("/usr/lib64/libcrypto.so.1.0.0", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\340\312\5\0\0\0\0\0"..., 832) =\
 832
fstat(3, {st_mode=S_IFREG|0555, st_size=1814544, ...}) = 0
mmap(NULL, 3924896, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48f65b\
000
mprotect(0x7fd48f7f1000, 2093056, PROT_NONE) = 0
mmap(0x7fd48f9f0000, 155648, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE,\
 3, 0x195000) = 0x7fd48f9f0000
mmap(0x7fd48fa16000, 13216, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, \
-1, 0) = 0x7fd48fa16000
close(3)                                = 0
open("/usr/lib64/libssl.so.1.0.0", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\340\215\1\0\0\0\0\0"..., 832) =\
 832
fstat(3, {st_mode=S_IFREG|0555, st_size=424728, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd4915810\
00
mmap(NULL, 2520304, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48f3f3\
000
mprotect(0x7fd48f452000, 2093056, PROT_NONE) = 0
mmap(0x7fd48f651000, 40960, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x5e000) = 0x7fd48f651000
close(3)                                = 0
open("/lib64/libpthread.so.0", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\0a\0\0\0\0\0\0"..., 832) = 832
fstat(3, {st_mode=S_IFREG|0755, st_size=131096, ...}) = 0
mmap(NULL, 2208872, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48f1d7\
000
mprotect(0x7fd48f1ee000, 2093056, PROT_NONE) = 0
mmap(0x7fd48f3ed000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x16000) = 0x7fd48f3ed000
mmap(0x7fd48f3ef000, 13416, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, \
-1, 0) = 0x7fd48f3ef000
close(3)                                = 0
open("/lib64/librt.so.1", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\340 \0\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0755, st_size=31472, ...}) = 0
mmap(NULL, 2128864, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48efcf\
000
mprotect(0x7fd48efd6000, 2093056, PROT_NONE) = 0
mmap(0x7fd48f1d5000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x6000) = 0x7fd48f1d5000
close(3)                                = 0
open("/lib64/libncurses.so.6", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\340X\1\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0755, st_size=386848, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd4915800\
00
mmap(NULL, 2484624, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48ed70\
000
mprotect(0x7fd48edca000, 2093056, PROT_NONE) = 0
mmap(0x7fd48efc9000, 20480, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, \
3, 0x59000) = 0x7fd48efc9000
mmap(0x7fd48efce000, 2448, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -\
1, 0) = 0x7fd48efce000
close(3)                                = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd49157f0\
00
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd49157e0\
00
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd49157d0\
00
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7fd49157c0\
00
arch_prctl(ARCH_SET_FS, 0x7fd49157d700) = 0
mprotect(0x7fd490839000, 16384, PROT_READ) = 0
mprotect(0x7fd48fc1c000, 4096, PROT_READ) = 0
mprotect(0x7fd48efc9000, 16384, PROT_READ) = 0
mprotect(0x7fd48f3ed000, 4096, PROT_READ) = 0
mprotect(0x7fd48f1d5000, 4096, PROT_READ) = 0
mprotect(0x7fd48f9f0000, 110592, PROT_READ) = 0
mprotect(0x7fd48f651000, 16384, PROT_READ) = 0
mprotect(0x7fd48ff21000, 4096, PROT_READ) = 0
mprotect(0x7fd490138000, 4096, PROT_READ) = 0
mprotect(0x7fd490499000, 32768, PROT_READ) = 0
mprotect(0x7fd490a83000, 8192, PROT_READ) = 0
mprotect(0x7fd490cdb000, 4096, PROT_READ) = 0
mprotect(0x609000, 4096, PROT_READ)     = 0
mprotect(0x7fd49158e000, 4096, PROT_READ) = 0
munmap(0x7fd491584000, 34104)           = 0
set_tid_address(0x7fd49157d9d0)         = 12224
set_robust_list(0x7fd49157d9e0, 24)     = 0
rt_sigaction(SIGRTMIN, {0x7fd48f1dcb40, [], SA_RESTORER|SA_SIGINFO, 0x7fd48f1e83b0}, NU\
LL, 8) = 0
rt_sigaction(SIGRT_1, {0x7fd48f1dcbe0, [], SA_RESTORER|SA_RESTART|SA_SIGINFO, 0x7fd48f1\
e83b0}, NULL, 8) = 0
rt_sigprocmask(SIG_UNBLOCK, [RTMIN RT_1], NULL, 8) = 0
getrlimit(RLIMIT_STACK, {rlim_cur=8192*1024, rlim_max=RLIM64_INFINITY}) = 0
brk(NULL)                               = 0x1919000
brk(0x193a000)                          = 0x193a000
brk(0x1969000)                          = 0x1969000
open("/var/spool/torque/server_name", O_RDONLY) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=5, ...}) = 0
read(3, "hp24\n", 4096)                 = 5
read(3, "", 4096)                       = 0
close(3)                                = 0
open("/etc/nsswitch.conf", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=508, ...}) = 0
read(3, "# /etc/nsswitch.conf:\n# $Header:"..., 4096) = 508
read(3, "", 4096)                       = 0
close(3)                                = 0
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=34104, ...}) = 0
mmap(NULL, 34104, PROT_READ, MAP_PRIVATE, 3, 0) = 0x7fd491584000
close(3)                                = 0
open("/lib64/libnss_db.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\200\27\0\0\0\0\0\0"..., 832) = \
832
fstat(3, {st_mode=S_IFREG|0755, st_size=30864, ...}) = 0
mmap(NULL, 2126880, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48eb68\
000
mprotect(0x7fd48eb6e000, 2097152, PROT_NONE) = 0
mmap(0x7fd48ed6e000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x6000) = 0x7fd48ed6e000
close(3)                                = 0
open("/lib64/libnss_files.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0`!\0\0\0\0\0\0"..., 832) = 832
fstat(3, {st_mode=S_IFREG|0755, st_size=43248, ...}) = 0
mmap(NULL, 2139528, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48e95d\
000
mprotect(0x7fd48e967000, 2093056, PROT_NONE) = 0
mmap(0x7fd48eb66000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x9000) = 0x7fd48eb66000
close(3)                                = 0
mprotect(0x7fd48eb66000, 4096, PROT_READ) = 0
mprotect(0x7fd48ed6e000, 4096, PROT_READ) = 0
munmap(0x7fd491584000, 34104)           = 0
open("/var/db/services.db", O_RDONLY|O_CLOEXEC) = -1 ENOENT (No such file or directory)
open("/etc/services", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=36475, ...}) = 0
read(3, "# /etc/services\n#\n# Network serv"..., 4096) = 4096
read(3, " private\t77/tcp\t\t\t\t# any private"..., 4096) = 4096
read(3, "e\nemfis-cntl\t141/udp\nimap\t\t143/t"..., 4096) = 4096
read(3, "dialog\t360/tcp\t\t\t\t# scoi2odialog"..., 4096) = 4096
read(3, "\t\tdqs313_intercell\ncryptoadmin\t6"..., 4096) = 4096
read(3, "# Citrix ICA Client\nica\t\t1494/ud"..., 4096) = 4096
read(3, "erPlay Control\nppcontrol\t2505/ud"..., 4096) = 4096
read(3, "cp\t\t\t# CFengine\ncfengine\t5308/ud"..., 4096) = 4096
read(3, "ce\t\t11111/udp\nsmsqp\t\t11201/tcp\t\t"..., 4096) = 3707
read(3, "", 4096)                       = 0
close(3)                                = 0
getuid()                                = 0
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=34104, ...}) = 0
mmap(NULL, 34104, PROT_READ, MAP_PRIVATE, 3, 0) = 0x7fd491584000
close(3)                                = 0
open("/lib64/libnss_compat.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\300\22\0\0\0\0\0\0"..., 832) = \
832
fstat(3, {st_mode=S_IFREG|0755, st_size=31336, ...}) = 0
mmap(NULL, 2126944, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48e755\
000
mprotect(0x7fd48e75c000, 2093056, PROT_NONE) = 0
mmap(0x7fd48e95b000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x6000) = 0x7fd48e95b000
close(3)                                = 0
open("/lib64/libnsl.so.1", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\340?\0\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0755, st_size=84688, ...}) = 0
mmap(NULL, 2190232, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48e53e\
000
mprotect(0x7fd48e552000, 2093056, PROT_NONE) = 0
mmap(0x7fd48e751000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0x13000) = 0x7fd48e751000
mmap(0x7fd48e753000, 7064, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_ANONYMOUS, -\
1, 0) = 0x7fd48e753000
close(3)                                = 0
mprotect(0x7fd48e751000, 4096, PROT_READ) = 0
mprotect(0x7fd48e95b000, 4096, PROT_READ) = 0
munmap(0x7fd491584000, 34104)           = 0
open("/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=34104, ...}) = 0
mmap(NULL, 34104, PROT_READ, MAP_PRIVATE, 3, 0) = 0x7fd491584000
close(3)                                = 0
open("/lib64/libnss_nis.so.2", O_RDONLY|O_CLOEXEC) = 3
read(3, "\177ELF\2\1\1\0\0\0\0\0\0\0\0\0\3\0>\0\1\0\0\0\240 \0\0\0\0\0\0"..., 832) = 83\
2
fstat(3, {st_mode=S_IFREG|0755, st_size=47400, ...}) = 0
mmap(NULL, 2143624, PROT_READ|PROT_EXEC, MAP_PRIVATE|MAP_DENYWRITE, 3, 0) = 0x7fd48e332\
000
mprotect(0x7fd48e33c000, 2097152, PROT_NONE) = 0
mmap(0x7fd48e53c000, 8192, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED|MAP_DENYWRITE, 3\
, 0xa000) = 0x7fd48e53c000
close(3)                                = 0
mprotect(0x7fd48e53c000, 4096, PROT_READ) = 0
munmap(0x7fd491584000, 34104)           = 0
open("/etc/passwd", O_RDONLY|O_CLOEXEC) = 3
lseek(3, 0, SEEK_CUR)                   = 0
fstat(3, {st_mode=S_IFREG|0644, st_size=830, ...}) = 0
mmap(NULL, 830, PROT_READ, MAP_SHARED, 3, 0) = 0x7fd49158c000
lseek(3, 830, SEEK_SET)                 = 830
munmap(0x7fd49158c000, 830)             = 0
close(3)                                = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
open("/etc/host.conf", O_RDONLY|O_CLOEXEC) = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=935, ...}) = 0
read(4, "# /etc/host.conf:\n# $Header: /va"..., 4096) = 935
read(4, "", 4096)                       = 0
close(4)                                = 0
futex(0x7fd490841e04, FUTEX_WAKE_PRIVATE, 2147483647) = 0
open("/etc/resolv.conf", O_RDONLY|O_CLOEXEC) = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=179, ...}) = 0
read(4, "# Generated by dhcpcd from eth0."..., 4096) = 179
read(4, "", 4096)                       = 0
close(4)                                = 0
uname({sysname="Linux", nodename="hp24", ...}) = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
nanosleep({0, 1000000}, NULL)           = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
stat("/etc/resolv.conf", {st_mode=S_IFREG|0644, st_size=179, ...}) = 0
open("/etc/resolv.conf", O_RDONLY|O_CLOEXEC) = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=179, ...}) = 0
read(4, "# Generated by dhcpcd from eth0."..., 4096) = 179
read(4, "", 4096)                       = 0
close(4)                                = 0
uname({sysname="Linux", nodename="hp24", ...}) = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
nanosleep({0, 1000000}, NULL)           = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
nanosleep({0, 1000000}, NULL)           = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
nanosleep({0, 1000000}, NULL)           = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
nanosleep({0, 1000000}, NULL)           = 0
socket(AF_INET, SOCK_STREAM, IPPROTO_IP) = 3
stat("/var/spool/torque/torque.cfg", 0x7ffff28798d0) = -1 ENOENT (No such file or direc\
tory)
socket(AF_NETLINK, SOCK_RAW, NETLINK_ROUTE) = 4
bind(4, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12) = 0
getsockname(4, {sa_family=AF_NETLINK, nl_pid=12224, nl_groups=00000000}, [12]) = 0
sendto(4, {{len=20, type=0x16 /* NLMSG_??? */, flags=NLM_F_REQUEST|0x300, seq=149305206\
1, pid=0}, "\0\0\0\0"}, 20, 0, {sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, 12\
) = 20
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=84, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\2\10\200\376\1\0\0\0\10\0\1\0\177\0\0\1\10\0\2\0\177\0\0\1\10\0\\
4\0\177\377\377\377"...}, {{len=88, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1\
493052061, pid=12224}, "\2\30\200\0\2\0\0\0\10\0\1\0\300\250\0\23\10\0\2\0\300\250\0\23\
\10\0\4\0\300\250\0\377"...}, {{len=0, type=0 /* NLMSG_??? */, flags=0, seq=0, pid=0}}]\
, iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 172
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=149\
3052061, pid=12224}, "\n\200\200\376\1\0\0\0\24\0\1\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\1\2\
4\0\6\0"...}, {{len=72, type=0x14 /* NLMSG_??? */, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\n@\200\375\2\0\0\0\24\0\1\0\376\200\0\0\0\0\0\0\32\251\5\377\376i\rX\24\0\\
6\0"...}, {{len=524296, type=0x80 /* NLMSG_??? */, flags=0, seq=393236, pid=4294967295}\
, "\377\377\377\377\32\t\0\0\32\t\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...}], io\
v_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 144
recvmsg(4, {msg_name={sa_family=AF_NETLINK, nl_pid=0, nl_groups=00000000}, msg_namelen=\
12, msg_iov=[{iov_base=[{{len=20, type=NLMSG_DONE, flags=NLM_F_MULTI, seq=1493052061, p\
id=12224}, "\0\0\0\0"}, {{len=1, type=0x14 /* NLMSG_??? */, flags=NLM_F_REQUEST, seq=0,\
 pid=0}}], iov_len=4096}], msg_iovlen=1, msg_controllen=0, msg_flags=0}, 0) = 20
close(4)                                = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 4
fstat(4, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(4, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(4, "", 4096)                       = 0
close(4)                                = 0
fcntl(3, F_GETFL)                       = 0x2 (flags O_RDWR)
fcntl(3, F_SETFL, O_RDWR|O_NONBLOCK)    = 0
connect(3, {sa_family=AF_INET, sin_port=htons(15001), sin_addr=inet_addr("192.168.0.19"\
)}, 16) = -1 EINPROGRESS (Operation now in progress)
select(4, NULL, [3], NULL, {10800, 0})  = 1 (out [3], left {10799, 999997})
getsockopt(3, SOL_SOCKET, SO_ERROR, [111], [4]) = 0
close(3)                                = 0
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 3
fstat(3, {st_mode=S_IFREG|0644, st_size=3207, ...}) = 0
read(3, "# /etc/hosts: Local Host Databas"..., 4096) = 3207
read(3, "", 4096)                       = 0
close(3)                                = 0
write(2, "Unable to communicate with hp24("..., 46) = 46
write(2, "Communication failure.\n", 23) = 23
write(2, "qmgr: cannot connect to server  "..., 81) = 81
exit_group(3)                           = ?
+++ exited with 3 +++

I have often had much better luck with starting over than with "fixing" things I don't understand... so if anyone can walk me through how to get torque running (from the start, as in: kernel settings, packages and use flags, permissions, configuration files/commands, etc.) then I'ld be willing to give that a go.

Anyhow, any help would be appreciated.


Last edited by dpaddy on Tue Apr 25, 2017 3:25 pm; edited 1 time in total
Back to top
View user's profile Send private message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Mon Apr 24, 2017 10:03 pm    Post subject: Reply with quote

I can't claim to know anything about this, but
Code:
Nmap scan report for hp24 (192.168.0.19)
Host is up (0.0000090s latency).
Not shown: 998 closed ports
PORT    STATE SERVICE VERSION
22/tcp  open  ssh     OpenSSH 7.3 (protocol 2.0)
111/tcp open  rpcbind
Device type: general purpose
Running: Linux 3.X|4.X
OS CPE: cpe:/o:linux:linux_kernel:3 cpe:/o:linux:linux_kernel:4
OS details: Linux 3.8 - 4.1
Network Distance: 0 hops
I don't know what a "closed port" is, but I suspect it means that nothing happens to be listening on the ports described by "Not shown: 998 closed ports" (if not, please enlighten).

If that is the case, then perhaps the problem with qmgr (i.e., "Unable to communicate with hp24(192.168.0.19)") is simply that nothing happens to be listening on port 15001 (strace above makes reference to 15001). And if so, then might the problem be solved by having some appropriate something listen on port 15001? What might it be... more specifically, what EXACTLY should be typed before executing qmgr -c "set server acl_hosts = hp24" :?:

Then again, I probably don't know what I'm talking about.
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 11:52 am    Post subject: Reply with quote

Does the directory /var/spool/torque exist? What are the permissions on that directory?

Code:
ls -l /var/spool/torque


My guesses are that either torque is not starting, or that "turning off" nftables has the effect of a closed firewall. Your nmap result shows the firewall is not entirely closed or locked down.

On 192.168.0.19, you can see what ports are "listening" (independent of the firewall)

Code:
netstat -tunlp


Based on suggestions from your strace, my guess is that torque is not running at all, and there is nothing listening on port 15001

Edit to add: I don't run torque, so have no experience with setting it up. Just out of curiosity I clicked on the link that accompanies installing torque. Link not found. Sheesh. It gets better ...

Using some google-fu, I found a webpage that contains instructions for setting up torque

http://www.adaptivecomputing.com/support/documentation-index/torque-resource-manager-documentation/

There is NO DOCUMENTATION for version 4.1.x, that version is no longer serviced! I could view documentation for version 4.2, which is called to be at "End of Life." Long story short, I think you will have an uphill chore here. If I wanted to run torque, I'd unmask the 6.0.1 version by adding an entry to /etc/portage/package.keywords/package.accept_keywords (portage accepts many options for the location and name of the keywords file(s), that is where I keep my keywords).[/code]


Last edited by cboldt on Tue Apr 25, 2017 12:12 pm; edited 1 time in total
Back to top
View user's profile Send private message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Tue Apr 25, 2017 12:10 pm    Post subject: Reply with quote

I checked basic communication over socket 15001 with the following server
Code:
// simple TCP echo server : args: cpserver <port>                                       
                                                                                       
#include <stdio.h>                                                                     
#include <unistd.h>                                                                     
#include <stdlib.h>                                                                     
#include <string.h>                                                                     
#include <netdb.h>                                                                     
#include <sys/types.h>                                                                 
#include <sys/socket.h>                                                                 
#include <netinet/in.h>                                                                 
#include <arpa/inet.h>                                                                 
                                                                                       
#define BUFSIZE 1024                                                                   
                                                                                       
#if 0                                                                                   
                                                                                       
struct in_addr  // Internet address                                                     
{                                                                                       
  unsigned int s_addr;                                                                 
};                                                                                     
                                                                                       
struct sockaddr_in // Internet style socket address                                     
{                                                                                       
  unsigned short int sin_family; // Address family                                     
  unsigned short int sin_port;   // Port number                                         
  struct in_addr     sin_addr;   // IP address                                         
  unsigned char sin_zero[...];   // Pad to size of 'struct sockaddr'                   
};                                                                                     
                                                                                       
struct hostent // DNS host entry                                                       
{                                                                                       
  char    *h_name;                // host name                                         
  char    **h_aliases;            // alias list                                         
  int     h_addrtype;             // host address type                                 
  int     h_length;               // length of address                                 
  char    **h_addr_list;          // list of addresses                                 
}                                                                                       
                                                                                       
#endif                                                                                 
                                                                                       
void error(char *msg)                                                                   
{                                                                                       
  perror(msg);                                                                         
  exit(1);                                                                             
}                                                                                       
                                                                                       
int main(int argc, char **argv)                                                         
{                                                                                       
  int parentfd, childfd;            // parent, child socket                             
  int portno;                       // port to listen on                               
  int clientlen;                    // byte size of client's address                   
  struct sockaddr_in serveraddr;    // server's addr                                   
  struct sockaddr_in clientaddr;    // client addr                                     
  struct hostent *hostp;            // client host info                                 
  char buf[BUFSIZE];                // message buffer                                   
  char *hostaddrp;                  // dotted decimal host addr string                 
  int optval;                       // flag for setsockopt                             
  int n;                            // message byte size                               
                                                                                       
  if (argc != 2) {                                                                     
    fprintf(stderr, "usage: %s <port>\n", argv[0]);                                     
    exit(1);                                                                           
  }                                                                                     
  portno = atoi(argv[1]);                                                               
  parentfd = socket(AF_INET, SOCK_STREAM, 0);                                           
  if (parentfd < 0)                                                                     
    error("ERROR opening socket");                                                     
                                                                                       
  // setsockopt: can restart server immediately after killing it                       
  optval = 1;                                                                           
  setsockopt(parentfd, SOL_SOCKET, SO_REUSEADDR, (const void *)&optval , sizeof(int)); 
                                                                                       
  // build the server's Internet address                                               
  bzero((char *) &serveraddr, sizeof(serveraddr));                                     
  serveraddr.sin_family = AF_INET;                                                     
  serveraddr.sin_addr.s_addr = htonl(INADDR_ANY); // system figures out IP address     
                                                                                       
  serveraddr.sin_port = htons((unsigned short)portno); // port to listen on             
                                                                                       
  // associate the parent socket with a port                                           
  if (bind(parentfd, (struct sockaddr *) &serveraddr, sizeof(serveraddr)) < 0)         
    error("ERROR on binding");                                                         
                                                                                       
  //make socket ready to accept connection requests (allow 5 requests)                 
  if (listen(parentfd, 5) < 0)                                                         
    error("ERROR on listen");
                                                                                       
  //make socket ready to accept connection requests (allow 5 requests)                 
  if (listen(parentfd, 5) < 0)                                                         
    error("ERROR on listen");                                                           
                                                                                       
  clientlen = sizeof(clientaddr);                                                       
  while (1) {                                                                           
    // wait for a connection request                                                   
    childfd = accept(parentfd, (struct sockaddr *) &clientaddr, &clientlen);           
    if (childfd < 0)                                                                   
      error("ERROR on accept");                                                         
                                                                                       
    // determine who sent the message                                                   
    hostp = gethostbyaddr((const char *)&clientaddr.sin_addr.s_addr,                   
                          sizeof(clientaddr.sin_addr.s_addr), AF_INET);                 
    if (hostp == NULL)                                                                 
      error("ERROR on gethostbyaddr");                                                 
                                                                                       
    hostaddrp = inet_ntoa(clientaddr.sin_addr);                                         
                                                                                       
    if (hostaddrp == NULL)                                                             
      error("ERROR on inet_ntoa\n");                                                   
    printf("server established connection with %s (%s)\n", hostp->h_name, hostaddrp);   
                                                                                       
    // read input string from the client                                               
    bzero(buf, BUFSIZE);                                                               
    if ((n = read(childfd, buf, BUFSIZE)) < 0)                                         
      error("ERROR reading from socket");                                               
                                                                                       
    printf("server received %d bytes: %s", n, buf);                                     
                                                                                       
    // echo the input back to client                                                   
    if ((n = write(childfd, buf, strlen(buf))) < 0)                                     
      error("ERROR writing to socket");                                                 
                                                                                       
    close(childfd);                                                                     
  }                                                                                     
  return 0;                                                                             
}                                                                                       
                                                                                       
/*                                                                                     
gcc -o tcps tcpserver.c                                                                 
./tcps 15001                                                                           
                                                                                       
*/
and client
Code:
 // simple TCP client : args : <host> <port>                                             
#include <stdio.h>                                                                     
#include <stdlib.h>                                                                     
#include <string.h>                                                                     
#include <unistd.h>                                                                     
#include <sys/types.h>                                                                 
#include <sys/socket.h>                                                                 
#include <netinet/in.h>                                                                 
#include <netdb.h>                                                                     
                                                                                       
#define BUFSIZE 1024                                                                   
                                                                                       
void error(char *msg)                                                                   
{                                                                                       
  perror(msg);                                                                         
  exit(0);                                                                             
}                                                                                       
                                                                                       
int main(int argc, char **argv)                                                         
{                                                                                       
  int sockfd, portno, n;                                                               
  struct sockaddr_in serveraddr;                                                       
  struct hostent *server;                                                               
  char *hostname;                                                                       
  char buf[BUFSIZE];                                                                   
                                                                                       
  if (argc != 3) {                                                                     
    fprintf(stderr,"usage: %s <hostname> <port>\n", argv[0]);                           
    exit(0);                                                                           
  }                                                                                     
  hostname = argv[1];                                                                   
  portno = atoi(argv[2]);                                                               
                                                                                       
  sockfd = socket(AF_INET, SOCK_STREAM, 0); // create socket                           
                                                                                       
  if (sockfd < 0)                                                                       
    error("ERROR opening socket");                                                     
                                                                                       
  server = gethostbyname(hostname); // get server's DNS entry                           
                                                                                       
  if (server == NULL) {                                                                 
    fprintf(stderr,"ERROR, no such host as %s\n", hostname);                           
    exit(0);                                                                           
  }                                                                                     
                                                                                       
  // server's internet address                                                         
  bzero((char *) &serveraddr, sizeof(serveraddr));                                     
  serveraddr.sin_family = AF_INET;                                                     
  bcopy((char *)server->h_addr,                                                         
        (char *)&serveraddr.sin_addr.s_addr, server->h_length);                         
  serveraddr.sin_port = htons(portno);                                                 
                                                                                       
  // create connection with server                                                     
  if (connect(sockfd, (const struct sockaddr *) &serveraddr, sizeof(serveraddr)) < 0)   
    error("ERROR connecting");                                                         
                                                                                       
  // get message line from user                                                         
  printf("Please enter msg: ");                                                         
  bzero(buf, BUFSIZE);                                                                 
  fgets(buf, BUFSIZE, stdin);                                                           
                                                                                       
  // send message to server                                                             
  n = write(sockfd, buf, strlen(buf));                                                 
  if (n < 0)                                                                           
    error("ERROR writing to socket");                                                   
                                                                                       
  // print server's reply                                                               
  bzero(buf, BUFSIZE);                                                                 
  n = read(sockfd, buf, BUFSIZE);                                                       
  if (n < 0)                                                                           
    error("ERROR reading from socket");                                                 
  printf("Echo from server: %s", buf);                                                 
  close(sockfd);                                                                       
  return 0;                                                                             
}                                                                                       
/*                                                                                     
gcc -o tcpc tcpclient.c                                                                 
./tcpc hp24 15001                                                                       
                                                                                       
*/
Communication works fine
Code:
hp24 ~/c $ ./tcps 15001
server established connection with hp24 (192.168.0.19)
server received 4 bytes: foo
server established connection with hp24 (192.168.0.19)
server received 4 bytes: bar
server established connection with hp24 (192.168.0.19)
server received 4 bytes: baz
^C

I conjecture it may be permissions / kernel settings (I wish someone could tell me what is needed or point me to documentation which actually does explicitly describe what is required) or perhaps the problem is borked configuration files...

I have reinstalled torque but no change in behavior, which leads me to ask: does anyone know the complete list of files/directories that should be removed to get me to a clean pre-torque state? Certainly emerge -C torque does not clean up the crap sprinkled about the file system by the torque install.

I have a spare machine on which I could do a fresh gentoo istall, clone the disk, emerge torque, and then diff the file structures (pre vs post torque) but there certainly must be a method of which I am unaware that is commonly used to really uninstall -- as in remove all the crap sprinkled about the file system by emerge --config ... :roll:
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 12:24 pm    Post subject: Reply with quote

There is no need to do a complete gentoo install just to deal with a single package or a few packages.

I suspect the collection of files installed by that emerge --config invocation are in fairly localized areas, which should be pretty easy to find and wipe out.

Code:
ls -rtl /etc
ls -rtl /var/spool


I'm guessing there is a /etc/torque (or similar), and it will be one of the newer directories
Based on the strace output, there is supposed to be a /var/spool/torque

Also, look at the contents of /usr/portage/sys-cluster/torque/torque-4.1.7-r1.ebuild
especially the "pkg_config()" subroutine. It appears to me that the --config operation works in ${PBS_SERVER_HOME}, and there is enough detain in the work there, that if I didn't know what that location was, I could find it.[/code]
Back to top
View user's profile Send private message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Tue Apr 25, 2017 12:34 pm    Post subject: Reply with quote

Cboldt:

My communication check was with the firewall up (I included rules to rules allow torque related communication).
With the firewall down, my communication check works as well.

Code:
# ls -l /var/spool/torque
total 52
drwxr-xr-x  2 root root 4096 Apr 24 08:28 aux
drwxr-xr-x  2 root root 4096 Apr 24 08:28 checkpoint
drwxr-xr-x  2 root root 4096 Apr 24 07:36 client_logs
drwxr-xr-x  2 root root 4096 Apr 24 08:28 job_logs
drwxr-xr-x  2 root root 4096 Apr 24 08:28 mom_logs
drwxr-xr-x  3 root root 4096 Apr 24 08:28 mom_priv
drwxr-xr-x  2 root root 4096 Apr 24 08:28 sched_logs
drwxr-xr-x  3 root root 4096 Apr 24 08:28 sched_priv
drwxr-xr-x  2 root root 4096 Apr 24 08:28 server_logs
-rw-r--r--  1 root root    5 Apr 24 08:28 server_name
drwxr-xr-x 13 root root 4096 Apr 24 08:28 server_priv
drwxr-xr-x  2 root root 4096 Apr 24 08:28 spool
drwxr-xr-x  2 root root 4096 Apr 24 08:28 undelivered

Code:
# netstat -tunlp
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address           Foreign Address         State       PID/Program name   
tcp        0      0 0.0.0.0:22              0.0.0.0:*               LISTEN      1527/sshd           
tcp        0      0 0.0.0.0:54407           0.0.0.0:*               LISTEN      1500/rpc.statd     
tcp        0      0 0.0.0.0:111             0.0.0.0:*               LISTEN      1475/rpcbind       
tcp6       0      0 :::22                   :::*                    LISTEN      1527/sshd           
tcp6       0      0 :::111                  :::*                    LISTEN      1475/rpcbind       
udp        0      0 0.0.0.0:68              0.0.0.0:*                           1381/dhcpcd         
udp        0      0 0.0.0.0:111             0.0.0.0:*                           1475/rpcbind       
udp        0      0 0.0.0.0:33298           0.0.0.0:*                           1500/rpc.statd     
udp        0      0 0.0.0.0:802             0.0.0.0:*                           1475/rpcbind       
udp        0      0 127.0.0.1:828           0.0.0.0:*                           1500/rpc.statd     
udp6       0      0 :::111                  :::*                                1475/rpcbind       
udp6       0      0 :::802                  :::*                                1475/rpcbind

I agree that it seems something needs to be listening on port 15001 ... when I run the simple client test code without the server test code running I get "ERROR connecting: Connection refused" (i.e., the connect call fails).

Yes, I noticed early on that the link was missing, there is no documentation for version 4.1.x, and 4.2 is "End of Life." I had considered 6.0.1 but one aspect of this is to achieve security / stability, so I was saving the unmask experiment as a last resort.
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 12:57 pm    Post subject: Reply with quote

The issue is just getting torque to start, which entails at least some sort of minimal configuration. That --config routine looks like it is doing that minimalist configuration work, one server, one client, and those two are the same machine. I believe the ONLY place that --config works is in the /var/spool/torque branch.

Aside from the --config routine, the basic install puts material in /etc/pbs_environment and adds a /etc/env.d/25torque file. If your USEflag collection includes "munge," there will also be an /etc/conf.d/torque file

Torque installs a collection of /etc/init.d files too -- I'd try to get configuration done with a combination of reading instructions and attempting to start the services.

/etc/init.d/pbs_server # if USEflag "server" was set
/etc/init.d/pbs_sched # if USEflag "server" was set
/etc/init.d/pbs_mom
/etc/init.d/trqauthd

You are installing torque on two machines, from what I gather (you tested communications between two machines to confirm the firewall was not interfering with communications), so the task includes configuration for a client, and for a server.

There is likely some sort of "cooperation" among the init.d files, where one depends on another, so when you get around to automating the startup of torque (at boot), you won't have to add all four services.
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 1:10 pm    Post subject: Reply with quote

Security and stability usually improve as a package is advanced. The typical downside is increase in bloat. One problem with sticking to the 4.1 version is total absence of instructions. At least the authors have instructions on the web for the 4.2 version, and later.

Even the 6.0.1 version is not the latest, last revised about a year ago, with action moving on to a 6.1 series of releases.

The gentoo maintainer isn't keeping up with the upstream action. I'm not throwing stones, just making the observation. The fact that the site of install instructions for the 4.1 version does not exist, and that upstream doesn't support this version should result in the 4.1 ebuild being removed from the tree. But there may be issues that make the 4.2 and 6.0 versions "slow to stabilize." I believe that is pretty common with packages that don't have a substantial number of users.
Back to top
View user's profile Send private message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Tue Apr 25, 2017 1:33 pm    Post subject: Reply with quote

w00t !

Changing permissions
Code:
# ls -l /var/spool/torque/
total 52
drwxr-xr-x  2 root root 4096 Apr 24 08:28 aux
drwxr-xr-x  2 root root 4096 Apr 24 08:28 checkpoint
drwxr-xr-x  2 root root 4096 Apr 25 05:26 client_logs
drwxr-xr-x  2 root root 4096 Apr 24 08:28 job_logs
drwxr-xr-x  2 root root 4096 Apr 24 08:28 mom_logs
drwxr-xr-x  3 root root 4096 Apr 24 08:28 mom_priv
drwxr-xr-x  2 root root 4096 Apr 24 08:28 sched_logs
drwxr-xr-x  3 root root 4096 Apr 24 08:28 sched_priv
drwxr-xr-x  2 root root 4096 Apr 25 05:15 server_logs
-rw-r--r--  1 root root    5 Apr 24 08:28 server_name
drwxr-xr-x 13 root root 4096 Apr 25 05:26 server_priv
drwxrwxrwt  2 root root 4096 Apr 24 08:28 spool
drwxrwxrwt  2 root root 4096 Apr 24 08:28 undelivered

and now
Code:
  # emerge --config torque


Configuring pkg...

 * Configuring Torque ...
 * Using /var/spool/torque as the pbs homedir
 * Using hp24 as the pbs_server
 * Previous Torque configuration detected.  Press Enter to
 * continue or Control-C to abort now
                                                                                 [ ok ]
Seems config worked (well, at least did not complain).

Don't know if I'm out of the weeds, but so far so good. :wink:
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 1:45 pm    Post subject: Reply with quote

IIRC, --config tries to write to /var/spool/torque/spool, and the previous permissions allowed only the user root to do that.

FWIW, if you want to check permissions on a directory (e.g., /var/spool/torque/spool), and the file list for the parent directory is lengthy (that is, if there are many entries in `ls /var/spool/torque`), use the "stat" command against the target instead of "ls -l" against the parent.

I was initially wondering about the permissions of /var/spool/torque itself (and I still don't know what those are, but I don't care any more, based on seeing a mess of files and directories in /var/spool/torque), and if I still though those permissions could be a problem, I'd ask to see the output of `stat /var/spool/torque`

What caused the change, by the way? Did you unmask a later version of torque?
Back to top
View user's profile Send private message
dpaddy
Tux's lil' helper
Tux's lil' helper


Joined: 25 Jun 2008
Posts: 117

PostPosted: Tue Apr 25, 2017 1:56 pm    Post subject: Reply with quote

I made the change, based on errors reported in /var/log/messages ... I was suspicious that permissions were somehow related. :idea:
Back to top
View user's profile Send private message
cboldt
l33t
l33t


Joined: 24 Aug 2005
Posts: 829

PostPosted: Tue Apr 25, 2017 2:00 pm    Post subject: Reply with quote

Worth observing that PBS_Server writes messages to /var/log/messages

There may be a way to change the log_level, in case there is a problem getting some other part of torque to run the way you want it to.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Portage & Programming All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum