Hi Ken and All,<br><br>i've solved the problem with my openmpi build. <br>below are an example of it.<br><br>yewyong@vrc1:~/installer/mpi_test> mpirun -np 14 -machinefile quad ./hello_parallel<br>hello_parallel.cc: Number of tasks=14 My rank=5 My name=vrcwr220.<br>
hello_parallel.cc: Number of tasks=14 My rank=7 My name=pc249.<br>hello_parallel.cc: Number of tasks=14 My rank=9 My name=pc205.<br>hello_parallel.cc: Number of tasks=14 My rank=1 My name=vrc1.<br>hello_parallel.cc: Number of tasks=14 My rank=3 My name=vrc1.<br>
hello_parallel.cc: Number of tasks=14 My rank=10 My name=pc226.<br>hello_parallel.cc: Number of tasks=14 My rank=13 My name=vrcwr228.<br>hello_parallel.cc: Number of tasks=14 My rank=0 My name=vrc1.<br>hello_parallel.cc: Number of tasks=14 My rank=4 My name=vrcwr220.<br>
hello_parallel.cc: Number of tasks=14 My rank=2 My name=vrc1.<br>hello_parallel.cc: Number of tasks=14 My rank=8 My name=pc205.<br>hello_parallel.cc: Number of tasks=14 My rank=6 My name=pc249.<br>hello_parallel.cc: Number of tasks=14 My rank=12 My name=vrcwr228.<br>
hello_parallel.cc: Number of tasks=14 My rank=11 My name=pc226.<br><br>however, my problem still exist. my paraview doesn't seems to give any response when i'm trying to connect<br>to the pvserver.<br><u><br>Starts the pvserver</u><br>
yewyong@vrc1:~/installer/mpi_test> mpirun -np 14 -machinefile quad pvserver --use-offscreen-rendering<br>Listen on port: 11111<br>Waiting for client...<br>Client connected.<br><br><u>And the client shows....</u><br>yewyong@pc226:~> paraview<br>
Waiting for server...<br><br>and the paraview "hangs" with a "choose server" window. (no buttons are functioning)<br>i had to go to the konsole to "ctrl+c" to actually exit the paraview.<br><br>
I tried having different sequence in machinefile to check whether it's the problem with different IP domain<br><br><u>my machine file (quad):</u><br>202.185.77.110 slots=4 max-slots=4 (master node with 2 ethernet port; another is 192.168 seires)<br>
192.168.0.220 slots=2 max-slots=2<br>192.168.0.205 slots=2 max-slots=2<br>192.168.0.249 slots=2 max-slots=2<br>202.185.77.219 slots=2 max-slots=2<br>202.185.77.220 slots=2 max-slots=2<br><br>as per my findings, paraview will have problem connecting to pvserver whenever pvserver is build from nodes <br>
from different IP domain.<br><br>ps, even with hamachi (vpn), when each nodes are having 5.xx.xx.xx series as IP, the problem persists.<br><br>sorry for the lengthy "story telling" mail.<br><br>appreciate all the help i can get.<br>
<br>Thanks in advance.<br><br>yewyong<br><br><br><br><br><div class="gmail_quote">On Thu, Oct 8, 2009 at 1:22 AM, yewyong <span dir="ltr"><<a href="mailto:uyong81@gmail.com">uyong81@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Hi Ken,<br><br>Thanks for the swift reply, and for the reminder.<br>forgot to try out the mpirun before trying it out in pvserver.<br>tried a very simple "hello world" mpi job today. (to no avail)<br><br>at first, i thought it was because of the difference of 192 and 202 domain which causes the pause, then i applied hamachi (vpn tool) to all the nodes, giving them a uniform LAN domain IPs, but still the pauses happens when reaching np > 10. <br>
<br>that all boils down to my openmpi build? but i am able to run >10 nps pvserver with the same openmpi build when having the nodes when in a LAN environment (not using hamachi)..<br><br>no worries, i'm using the cvs version of the paraview build. (it states "paraview 3.7 development" at the title window)<br>
<br><br>thanks again for all your advice in advance.<div><div></div><div class="h5"><br><br><div class="gmail_quote">On Tue, Oct 6, 2009 at 10:06 PM, Moreland, Kenneth <span dir="ltr"><<a href="mailto:kmorel@sandia.gov" target="_blank">kmorel@sandia.gov</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div>
<font face="Calibri, Verdana, Helvetica, Arial"><span style="font-size: 11pt;">I count 10 cores in the 192.168 domain. My suspicion is that MPI is having a problem when running with more than 10 processes and therefore having to run on nodes in both 192.168 and 202.185. <br>
<br>
My advice is to first make sure that you can run any MPI job across all the nodes in your makeshift cluster. Do this using a test MPI program that is simpler than pvserver. Most MPI implementations come with a set of tests. If you can’t find any on your system, there are many available for download; here is a good site: <a href="http://www.mcs.anl.gov/research/projects/mpi/mpi-test/tsuite.html" target="_blank">http://www.mcs.anl.gov/research/projects/mpi/mpi-test/tsuite.html</a><br>
<br>
By the way, are you really running a program called pvclient? There has been no such executable since ParaView 2.6, which was released 2 years ago. If that is the case, I recommend updated to the latest version of ParaView.<br>
<br>
-Ken<div><div></div><div><br>
<br>
<br>
On 10/6/09 6:22 AM, "yewyong" <<a href="http://uyong81@gmail.com" target="_blank">uyong81@gmail.com</a>> wrote:<br>
<br>
</div></div></span></font><div><div></div><div><blockquote><font face="Calibri, Verdana, Helvetica, Arial"><span style="font-size: 11pt;">Hi all,<br>
<br>
am trying to utilize some desktops in our lab (say "lab A") to visualize some virtual heart.<br>
on top of that, there are also some unused desktop from other lab ("lab B") which i plan to add in as the server.<br>
below are my "hostfile" used during mpirun.<br>
<br>
192.168.0.10 slots=4 max-slots=4<br>
192.168.0.220 slots=2 max-slots=2<br>
192.168.0.205 slots=2 max-slots=2<br>
192.168.0.249 slots=2 max-slots=2<br>
202.185.77.219 slots=2 max-slots=2<br>
202.185.77.220 slots=2 max-slots=2<br>
202.185.77.221 slots=2 max-slots=2<br>
<br>
as we can see, the 192.168... series are from lab A and the 202.185.. series are from lab B.<br>
the experiment started off fine as i'm also using a desktop from lab A (192.168.. series) to connect to the server.<br>
when we reach np 11, the paraview (pvclient) fails to response when we try to establish a connection with the server.<br>
<br>
are we missing out anything here?<br>
<br>
appreciate all your response and answer in advance <br>
<br>
Thanks<br>
<br>
yewyong<br>
<br>
<br>
<br>
</span></font></blockquote><font face="Calibri, Verdana, Helvetica, Arial"><span style="font-size: 11pt;"><br>
</span></font></div></div><font size="2"><font face="Consolas, Courier New, Courier"><span style="font-size: 10pt;"><br>
**** Kenneth Moreland<br>
*** Sandia National Laboratories<br>
*********** <br>
*** *** *** email: <a href="http://kmorel@sandia.gov" target="_blank">kmorel@sandia.gov</a><br>
** *** ** phone: (505) 844-8919<br>
*** web: <a href="http://www.cs.unm.edu/%7Ekmorel" target="_blank">http://www.cs.unm.edu/~kmorel</a><br>
</span></font></font><font face="Calibri, Verdana, Helvetica, Arial"><span style="font-size: 11pt;"><br>
</span></font>
</div>
</blockquote></div><br>
</div></div></blockquote></div><br>