how to use mpich2 on multicore PC under MSWin
Posted: Mon Aug 31, 2009 11:19 pm
Hardware: standalone quad core PC.
Software:
(1) NEURON 7.1 (315:dbb6f14221c5) 2009-06-08 (most recent alpha installer for MSWin) running under Vista.
(2) Modified test0.hoc(anything to do with hostname has been omitted since system("hostname", s) doesn't work on this machine under Vista, even though hostname works properly when called directly from the cygwin command line).
In this setting, mpdboot seems to be irrelevant--it complains "invalid port info: no_port" and doesn't start mpd. But
mpd &
succeeds as indicated by the fact that mpdtrace -l returns
loki_49384 (192.168.1.7)
which satisfies the minimal criterion according to
viewtopic.php?f=31&t=1761#p6240
and mpiexec with np=1 does indeed work:
However, np>1 doesn't work:
Software:
(1) NEURON 7.1 (315:dbb6f14221c5) 2009-06-08 (most recent alpha installer for MSWin) running under Vista.
(2) Modified test0.hoc
Code: Select all
objref pc
pc = new ParallelContext()
{
printf("There are %d processes. My rank is %d but I don't know where I am.\n", pc.nhost, pc.id)
}
{
pc.runworker()
pc.done()
}
quit()In this setting, mpdboot seems to be irrelevant--it complains "invalid port info: no_port" and doesn't start mpd. But
mpd &
succeeds as indicated by the fact that mpdtrace -l returns
loki_49384 (192.168.1.7)
which satisfies the minimal criterion according to
viewtopic.php?f=31&t=1761#p6240
and mpiexec with np=1 does indeed work:
Code: Select all
c:\nrn71\bin\bash-3.2$ mpiexec -np 1 nrniv -mpi test0.hoc
numprocs=1
NEURON -- VERSION 7.1 (315:dbb6f14221c5) 2009-06-08
Duke, Yale, and the BlueBrain Project -- Copyright 1984-2008
See http://www.neuron.yale.edu/credits.html
There are 1 processes. My rank is 0 but I don't know where I am.Code: Select all
c:\nrn71\bin\bash-3.2$ mpiexec -np 2 nrniv -mpi test0.hoc
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(294).............: Initialization failed
MPID_Init(93).....................: channel initialization failed
MPIDI_CH3_Init(43)................:
MPID_nem_init(176)................:
MPID_nem_seg_create(103)..........:
MPID_nem_remove_shared_memory(529): unable to remove shared memory - unlink Permission deniedrank 0 in job 2 loki_49384 caused collective abort of all ranks
exit status of rank 0: return code 1