<div dir="ltr">Hi Hong,<div><br></div><div>> 1. It appears IceT-based image compositing for 18k cores takes such a long time that it becomes unpractical to output images in-situ. </div><div>> Specifically, in our case, it takes about 14 minutes for coprocessing for one time point that output a composited image while simulation </div>
<div>> alone for one time point only takes about 7 seconds. I have also done a simulation run with in-situ visualization on Titan with 64 cores on a </div><div>> much lower resolution mesh (10 million element mesh as opposed on 167 million element mesh for 18k core run), in which case </div>
<div>> coprocessing with image output for 64 cores takes about 25 seconds. Question: is there any way to improve performance of image </div><div>> compositing for 18k cores for in-situ visualization?<br></div><div>
<br>
</div><div>This doesn't make a lot of sense. Image compositing performance is not strongly tied to the number of polygons. It is much more related to the number of cores and the image size. So 64K cores with small data should not perform so much better than 18K cores with large data. Since Ice-T takes bounding boxes into account when compositing, there may be performance gains when rendering less geometry but not to the extent that you are describing.</div>
<div><br></div><div>On the other hand, I can see Mesa rendering performance being an issue. The 18K run probably has significantly more polygons per MPI rank, specially if the polygons are not distributed somewhat evenly. This is definitely worthwhile investigating. Do you have cycles to run a few more cases? We can instrument things a bit better to see what is taking this much time.</div>
<div><br></div><div>> 2. I also tried to avoid image output, but output polydata extracts using XMLPPolyDataWriter instead on 18k cores. In this case, in-situ </div><div>> coprocessing only takes about 20 seconds (compared to 14 minutes with image output). However, too many files are generated to a point </div>
<div>> that breaks the hard limit on maximal number of files in a directory since the parallel writer writes a vtp file from each of 18k cores. So the </div><div>> output data files have to be broken up into different directories. However, I got “cannot find file” error when I put a directory name as a <br>
> parameter in coprocessor.CreateWriter() function call in my python script. I tried initially to put “data/vorticity_%t.pvtp” as a parameter, but it </div><div>> fails with “cannot find file” error. Not sure whether this is a bug or I need to put absolute full path in rather than a relative path to the current </div>
<div>> directory. Another question is whether there are ways to composite these files generated from different cores into one single file while doing </div><div>> coprocessing so only one composite file is generated rather than a huge number of files when running on large number of cores.</div>
<div><br></div><div>We are working on ADIOS based readers and writers that will allow for writing to a single bp file. This should be ready sometime in January. This should makes things much better.</div><div><br></div><div>
-berk</div><div> <br></div>On Tue, Nov 26, 2013 at 10:31 AM, Hong Yi <<a href="mailto:hongyi@renci.org">hongyi@renci.org</a>> wrote:<br>><br>> I have done several simulation runs linked with ParaView Catalyst for in-situ visualization on Titan with 18k cores and have the following observations/questions hoping to seek input from this list.<br>
><br>> <br>><br>> 1. It appears IceT-based image compositing for 18k cores takes such a long time that it becomes unpractical to output images in-situ. Specifically, in our case, it takes about 14 minutes for coprocessing for one time point that output a composited image while simulation alone for one time point only takes about 7 seconds. I have also done a simulation run with in-situ visualization on Titan with 64 cores on a much lower resolution mesh (10 million element mesh as opposed on 167 million element mesh for 18k core run), in which case coprocessing with image output for 64 cores takes about 25 seconds. Question: is there any way to improve performance of image compositing for 18k cores for in-situ visualization?<br>
><br>> 2. I also tried to avoid image output, but output polydata extracts using XMLPPolyDataWriter instead on 18k cores. In this case, in-situ coprocessing only takes about 20 seconds (compared to 14 minutes with image output). However, too many files are generated to a point that breaks the hard limit on maximal number of files in a directory since the parallel writer writes a vtp file from each of 18k cores. So the output data files have to be broken up into different directories. However, I got “cannot find file” error when I put a directory name as a parameter in coprocessor.CreateWriter() function call in my python script. I tried initially to put “data/vorticity_%t.pvtp” as a parameter, but it fails with “cannot find file” error. Not sure whether this is a bug or I need to put absolute full path in rather than a relative path to the current directory. Another question is whether there are ways to composite these files generated from different cores into one single file while doing coprocessing so only one composite file is generated rather than a huge number of files when running on large number of cores. <br>
><br>> Thanks for any input, suggestions, and comments!<br>><br>> <br>><br>> Regards,<br>><br>> Hong<br>><br>><br>> _______________________________________________<br>> Powered by <a href="http://www.kitware.com">www.kitware.com</a><br>
><br>> Visit other Kitware open-source projects at <a href="http://www.kitware.com/opensource/opensource.html">http://www.kitware.com/opensource/opensource.html</a><br>><br>> Please keep messages on-topic and check the ParaView Wiki at: <a href="http://paraview.org/Wiki/ParaView">http://paraview.org/Wiki/ParaView</a><br>
><br>> Follow this link to subscribe/unsubscribe:<br>> <a href="http://www.paraview.org/mailman/listinfo/paraview">http://www.paraview.org/mailman/listinfo/paraview</a><br>><br></div>