According to IBM's BlueGene development manual that Andrew Siegel posting
a link to recently, client TCP calls are supported in CNK. I've been
trying to get a CNK to connect to a socket service on another machine, but
I've had no luck. The goal is to interact with a running job on BGL.
I've been using a simple test client/server setup (the code's in
bgl:/home/chad/src/commtest/). Node 0 on BGL (the client) creates a socket
and tries to connect to my server process to send a "hello world" message.
The server process just listens on a particular TCP port and prints to
stdout whatever it hears. I've been using TCP port 65002.
Here's what I've tried:
-----
1. I tried connecting directly to my server running on terra.mcs.anl.gov,
but it didn't work. Since I have no idea what the network topology
involved looks like, I haven't spent a lot of time on this case. When I
try this, I get "Connection timed out" error.
-----
2. I tried using SSH to forward port 65002 from login2.bgl to
terra.mcs.anl.gov:65002, but it didn't work. When the node tries to
connect to port 65002 on the login node, it gets "Connection refused."
I established the tunnel from login2 like this: 'ssh -Ax -L
65002:terra.mcs.anl.gov:65002 -N terra.mcs.anl.gov'
I tried the following from a shell on login2 after starting the tunnel:
2.a. login2 localhost: 'telnet 127.0.0.1 65002' works fine.
The message shows up on the terra.mcs server.
2.b. login2 eth1: 'telnet 140.221.80.5 65002' fails. Connection refused.
2.c. login2 eth0: 'telnet 172.30.1.102 65002' fails. Connection refused.
So SSH port forwarding is apparently useless because connections to the
login node's externally-visible IP addresses are refused, even on a
non-privileged port. 127.0.0.1 is obviously not very useful.
-----
3. I can't try connecting directly to the server process running on a bgl
login node, because apparently I'm not allowed to create a socket:
chad@login2:~/src/commtest> ./daemon
wrappers.c: Socket(): socket(): Operation not permitted
-----
4. I wanted to try talking to an existing TCP service on a login node, but
echo, telnet, and http are all blocked.
-----
If anybody has any suggestions for something else I can try, please let me
know.
Thanks,
ccg
- --------------------------------------------------------------------
To add or remove yourself from this mailing list, use the 'notifyme'
command on any BGL machine. To remove: notifyme -n, to add: notifyme -y.