forked from openkylin/openmpi
4927 lines
234 KiB
Plaintext
4927 lines
234 KiB
Plaintext
|
Copyright (c) 2004-2010 The Trustees of Indiana University and Indiana
|
||
|
University Research and Technology
|
||
|
Corporation. All rights reserved.
|
||
|
Copyright (c) 2004-2006 The University of Tennessee and The University
|
||
|
of Tennessee Research Foundation. All rights
|
||
|
reserved.
|
||
|
Copyright (c) 2004-2008 High Performance Computing Center Stuttgart,
|
||
|
University of Stuttgart. All rights reserved.
|
||
|
Copyright (c) 2004-2006 The Regents of the University of California.
|
||
|
All rights reserved.
|
||
|
Copyright (c) 2006-2022 Cisco Systems, Inc. All rights reserved.
|
||
|
Copyright (c) 2006 Voltaire, Inc. All rights reserved.
|
||
|
Copyright (c) 2006 Sun Microsystems, Inc. All rights reserved.
|
||
|
Use is subject to license terms.
|
||
|
Copyright (c) 2006-2018 Los Alamos National Security, LLC. All rights
|
||
|
reserved.
|
||
|
Copyright (c) 2010-2020 IBM Corporation. All rights reserved.
|
||
|
Copyright (c) 2012 Oak Ridge National Labs. All rights reserved.
|
||
|
Copyright (c) 2012 Sandia National Laboratories. All rights reserved.
|
||
|
Copyright (c) 2012 University of Houston. All rights reserved.
|
||
|
Copyright (c) 2013 NVIDIA Corporation. All rights reserved.
|
||
|
Copyright (c) 2013-2018 Intel, Inc. All rights reserved.
|
||
|
Copyright (c) 2018-2022 Amazon.com, Inc. or its affiliates. All Rights
|
||
|
reserved.
|
||
|
Copyright (c) 2019-2021 Triad National Security, LLC. All rights
|
||
|
reserved.
|
||
|
Copyright (c) 2021 Google, LLC. All rights reserved.
|
||
|
$COPYRIGHT$
|
||
|
|
||
|
Additional copyrights may follow
|
||
|
|
||
|
$HEADER$
|
||
|
|
||
|
===========================================================================
|
||
|
|
||
|
This file contains the main features as well as overviews of specific
|
||
|
bug fixes (and other actions) for each version of Open MPI since
|
||
|
version 1.0.
|
||
|
|
||
|
As more fully described in the "Software Version Number" section in
|
||
|
the README file, Open MPI typically releases two separate version
|
||
|
series simultaneously. Since these series have different goals and
|
||
|
are semi-independent of each other, a single NEWS-worthy item may be
|
||
|
introduced into different series at different times. For example,
|
||
|
feature F was introduced in the vA.B series at version vA.B.C, and was
|
||
|
later introduced into the vX.Y series at vX.Y.Z.
|
||
|
|
||
|
The first time feature F is released, the item will be listed in the
|
||
|
vA.B.C section, denoted as:
|
||
|
|
||
|
(** also to appear: X.Y.Z) -- indicating that this item is also
|
||
|
likely to be included in future release
|
||
|
version vX.Y.Z.
|
||
|
|
||
|
When vX.Y.Z is later released, the same NEWS-worthy item will also be
|
||
|
included in the vX.Y.Z section and be denoted as:
|
||
|
|
||
|
(** also appeared: A.B.C) -- indicating that this item was previously
|
||
|
included in release version vA.B.C.
|
||
|
|
||
|
4.1.4 -- May, 2022
|
||
|
------------------
|
||
|
|
||
|
- Fix possible length integer overflow in numerous non-blocking collective
|
||
|
operations.
|
||
|
- Fix segmentation fault in UCX if MPI Tool interface is finalized before
|
||
|
MPI_Init is called.
|
||
|
- Remove /usr/bin/python dependency in configure.
|
||
|
- Fix OMPIO issue with long double etypes.
|
||
|
- Update treematch topology component to fix numerous correctness issues.
|
||
|
- Fix memory leak in UCX MCA parameter registration.
|
||
|
- Fix long operation closing file descriptors on non-Linux systems that
|
||
|
can appear as a hang to users.
|
||
|
- Fix for attribute handling on GCC 11 due to pointer aliasing.
|
||
|
- Fix multithreaded race in UCX PML's datatype handling.
|
||
|
- Fix a correctness issue in CUDA Reduce algorithm.
|
||
|
- Fix compilation issue with CUDA GPUDirect RDMA support.
|
||
|
- Fix to make shmem_calloc(..., 0) conform to the OpenSHMEM specification.
|
||
|
- Add UCC collectives component.
|
||
|
- Fix divide by zero issue in OMPI IO component.
|
||
|
- Fix compile issue with libnl when not in standard search locations.
|
||
|
|
||
|
|
||
|
4.1.3 -- March, 2022
|
||
|
--------------------
|
||
|
|
||
|
- Fixed a seg fault in the smcuda BTL. Thanks to Moritz Kreutzer and
|
||
|
@Stadik for reporting the issue.
|
||
|
- Added support for ELEMENTAL to the MPI handle comparison functions
|
||
|
in the mpi_f08 module. Thanks to Salvatore Filippone for raising
|
||
|
the issue.
|
||
|
- Minor datatype performance improvements in the CUDA-based code paths.
|
||
|
- Fix MPI_ALLTOALLV when used with MPI_IN_PLACE.
|
||
|
- Fix MPI_BOTTOM handling for non-blocking collectives. Thanks to
|
||
|
Lisandro Dalcin for reporting the problem.
|
||
|
- Enable OPAL memory hooks by default for UCX.
|
||
|
- Many compiler warnings fixes, particularly for newer versions of
|
||
|
GCC.
|
||
|
- Fix intercommunicator overflow with large payload collectives. Also
|
||
|
fixed MPI_REDUCE_SCATTER_BLOCK for similar issues with large payload
|
||
|
collectives.
|
||
|
- Back-port ROMIO 3.3 fix to use stat64() instead of stat() on GPFS.
|
||
|
- Fixed several non-blocking MPI collectives to not round fractions
|
||
|
based on float precision.
|
||
|
- Fix compile failure for --enable-heterogeneous. Also updated the
|
||
|
README to clarify that --enable-heterogeneous is functional, but
|
||
|
still not recomended for most environments.
|
||
|
- Minor fixes to OMPIO, including:
|
||
|
- Fixing the open behavior of shared memory shared file pointers.
|
||
|
Thanks to Axel Huebl for reporting the issue
|
||
|
- Fixes to clean up lockfiles when closing files. Thanks to Eric
|
||
|
Chamberland for reporting the issue.
|
||
|
- Update LSF configure failure output to be more clear (e.g., on RHEL
|
||
|
platforms).
|
||
|
- Update if_[in|ex]clude behavior in btl_tcp and oob_tcp to select
|
||
|
*all* interfaces that fall within the specified subnet range.
|
||
|
|
||
|
|
||
|
4.1.2 -- November, 2021
|
||
|
-----------------------
|
||
|
|
||
|
- ROMIO portability fix for OpenBSD
|
||
|
- Fix handling of MPI_IN_PLACE with MPI_ALLTOALLW and improve performance
|
||
|
of MPI_ALLTOALL and MPI_ALLTOALLV for MPI_IN_PLACE.
|
||
|
- Fix one-sided issue with empty groups in Post-Start-Wait-Complete
|
||
|
synchronization mode.
|
||
|
- Fix Fortran status returns in certain use cases involving
|
||
|
Generalized Requests
|
||
|
- Romio datatype bug fixes.
|
||
|
- Fix oshmem_shmem_finalize() when main() returns non-zero value.
|
||
|
- Fix wrong affinity under LSF with the membind option.
|
||
|
- Fix count==0 cases in MPI_REDUCE and MPI_IREDUCE.
|
||
|
- Fix ssh launching on Bourne-flavored shells when the user has "set
|
||
|
-u" set in their shell startup files.
|
||
|
- Correctly process 0 slots with the mpirun --host option.
|
||
|
- Ensure to unlink and rebind socket when the Open MPI session
|
||
|
directory already exists.
|
||
|
- Fix a segv in mpirun --disable-dissable-map.
|
||
|
- Fix a potential hang in the memory hook handling.
|
||
|
- Slight performance improvement in MPI_WAITALL when running in
|
||
|
MPI_THREAD_MULTIPLE.
|
||
|
- Fix hcoll datatype mapping and rooted operation behavior.
|
||
|
- Correct some operations modifying MPI_Status.MPI_ERROR when it is
|
||
|
disallowed by the MPI standard.
|
||
|
- UCX updates:
|
||
|
- Fix datatype reference count issues.
|
||
|
- Detach dynamic window memory when freeing a window.
|
||
|
- Fix memory leak in datatype handling.
|
||
|
- Fix various atomic operations issues.
|
||
|
- mpirun: try to set the curses winsize to the pty of the spawned
|
||
|
task. Thanks to Stack Overflow user @Seriously for reporting the
|
||
|
issue.
|
||
|
- PMIx updates:
|
||
|
- Fix compatibility with external PMIx v4.x installations.
|
||
|
- Fix handling of PMIx v3.x compiler/linker flags. Thanks to Erik
|
||
|
Schnetter for reporting the issue.
|
||
|
- Skip SLURM-provided PMIx detection when appropriate. Thanks to
|
||
|
Alexander Grund for reporting the issue.
|
||
|
- Fix handling by C++ compilers when they #include the STL "<version>"
|
||
|
header file, which ends up including Open MPI's text VERSION file
|
||
|
(which is not C code). Thanks to @srpgilles for reporting the
|
||
|
issue.
|
||
|
- Fix MPI_Op support for MPI_LONG.
|
||
|
- Make the MPI C++ bindings library (libmpi_cxx) explicitly depend on
|
||
|
the OPAL internal library (libopen-pal). Thanks to Ye Luo for
|
||
|
reporting the issue.
|
||
|
- Fix configure handling of "--with-libevent=/usr".
|
||
|
- Fix memory leak when opening Lustre files. Thanks to Bert Wesarg
|
||
|
for submitting the fix.
|
||
|
- Fix MPI_SENDRECV_REPLACE to correctly process datatype errors.
|
||
|
Thanks to Lisandro Dalcin for reporting the issue.
|
||
|
- Fix MPI_SENDRECV_REPLACE to correctly handle large data. Thanks
|
||
|
Jakub Benda for reporting this issue and suggesting a fix.
|
||
|
- Add workaround for TCP "dropped connection" errors to drastically
|
||
|
reduce the possibility of this happening.
|
||
|
- OMPIO updates:
|
||
|
- Fix handling when AMODE is not set. Thanks to Rainer Keller for
|
||
|
reporting the issue and supplying the fix.
|
||
|
- Fix FBTL "posix" component linking issue. Thanks for Honggang Li
|
||
|
for reporting the issue.
|
||
|
- Fixed segv with MPI_FILE_GET_BYTE_OFFSET on 0-sized file view.
|
||
|
- Thanks to GitHub user @shanedsnyder for submitting the issue.
|
||
|
- OFI updates:
|
||
|
- Multi-plane / Multi-Nic nic selection cleanups
|
||
|
- Add support for exporting Open MPI memory monitors into
|
||
|
Libfabric.
|
||
|
- Ensure that Cisco usNIC devices are never selected by the OFI
|
||
|
MTL.
|
||
|
- Fix buffer overflow in OFI networking setup. Thanks to Alexander
|
||
|
Grund for reporting the issue and supplying the fix.
|
||
|
- Fix SSEND on tag matching networks.
|
||
|
- Fix error handling in several MPI collectives.
|
||
|
- Fix the ordering of MPI_COMM_SPLIT_TYPE. Thanks to Wolfgang
|
||
|
Bangerth for raising the issue.
|
||
|
- No longer install the orted-mpir library (it's an internal / Libtool
|
||
|
convenience library). Thanks to Andrew Hesford for the fix.
|
||
|
- PSM2 updates:
|
||
|
- Allow advanced users to disable PSM2 version checking.
|
||
|
- Fix to allow non-default installation locations of psm2.h.
|
||
|
|
||
|
|
||
|
4.1.1 -- April, 2021
|
||
|
--------------------
|
||
|
|
||
|
- Fix a number of datatype issues, including an issue with
|
||
|
improper handling of partial datatypes that could lead to
|
||
|
an unexpected application failure.
|
||
|
- Change UCX PML to not warn about MPI_Request leaks during
|
||
|
MPI_FINALIZE by default. The old behavior can be restored with
|
||
|
the mca_pml_ucx_request_leak_check MCA parameter.
|
||
|
- Reverted temporary solution that worked around launch issues in
|
||
|
SLURM v20.11.{0,1,2}. SchedMD encourages users to avoid these
|
||
|
versions and to upgrade to v20.11.3 or newer.
|
||
|
- Updated PMIx to v3.2.2.
|
||
|
- Fixed configuration issue on Apple Silicon observed with
|
||
|
Homebrew. Thanks to François-Xavier Coudert for reporting the issue.
|
||
|
- Disabled gcc built-in atomics by default on aarch64 platforms.
|
||
|
- Disabled UCX PML when UCX v1.8.0 is detected. UCX version 1.8.0 has a bug that
|
||
|
may cause data corruption when its TCP transport is used in conjunction with
|
||
|
the shared memory transport. UCX versions prior to v1.8.0 are not affected by
|
||
|
this issue. Thanks to @ksiazekm for reporting the issue.
|
||
|
- Fixed detection of available UCX transports/devices to better inform PML
|
||
|
prioritization.
|
||
|
- Fixed SLURM support to mark ORTE daemons as non-MPI tasks.
|
||
|
- Improved AVX detection to more accurately detect supported
|
||
|
platforms. Also improved the generated AVX code, and switched to
|
||
|
using word-based MCA params for the op/avx component (vs. numeric
|
||
|
big flags).
|
||
|
- Improved OFI compatibility support and fixed memory leaks in error
|
||
|
handling paths.
|
||
|
- Improved HAN collectives with support for Barrier and Scatter. Thanks
|
||
|
to @EmmanuelBRELLE for these changes and the relevant bug fixes.
|
||
|
- Fixed MPI debugger support (i.e., the MPIR_Breakpoint() symbol).
|
||
|
Thanks to @louisespellacy-arm for reporting the issue.
|
||
|
- Fixed ORTE bug that prevented debuggers from reading MPIR_Proctable.
|
||
|
- Removed PML uniformity check from the UCX PML to address performance
|
||
|
regression.
|
||
|
- Fixed MPI_Init_thread(3) statement about C++ binding and update
|
||
|
references about MPI_THREAD_MULTIPLE. Thanks to Andreas Lösel for
|
||
|
bringing the outdated docs to our attention.
|
||
|
- Added fence_nb to Flux PMIx support to address segmentation faults.
|
||
|
- Ensured progress of AIO requests in the POSIX FBTL component to
|
||
|
prevent exceeding maximum number of pending requests on MacOS.
|
||
|
- Used OPAL's mutli-thread support in the orted to leverage atomic
|
||
|
operations for object refcounting.
|
||
|
- Fixed segv when launching with static TCP ports.
|
||
|
- Fixed --debug-daemons mpirun CLI option.
|
||
|
- Fixed bug where mpirun did not honor --host in a managed job
|
||
|
allocation.
|
||
|
- Made a managed allocation filter a hostfile/hostlist.
|
||
|
- Fixed bug to marked a generalized request as pending once initiated.
|
||
|
- Fixed external PMIx v4.x check.
|
||
|
- Fixed OSHMEM build with `--enable-mem-debug`.
|
||
|
- Fixed a performance regression observed with older versions of GCC when
|
||
|
__ATOMIC_SEQ_CST is used. Thanks to @BiplabRaut for reporting the issue.
|
||
|
- Fixed buffer allocation bug in the binomial tree scatter algorithm when
|
||
|
non-contiguous datatypes are used. Thanks to @sadcat11 for reporting the issue.
|
||
|
- Fixed bugs related to the accumulate and atomics functionality in the
|
||
|
osc/rdma component.
|
||
|
- Fixed race condition in MPI group operations observed with
|
||
|
MPI_THREAD_MULTIPLE threading level.
|
||
|
- Fixed a deadlock in the TCP BTL's connection matching logic.
|
||
|
- Fixed pml/ob1 compilation error when CUDA support is enabled.
|
||
|
- Fixed a build issue with Lustre caused by unnecessary header includes.
|
||
|
- Fixed a build issue with IMB LSF workload manager.
|
||
|
- Fixed linker error with UCX SPML.
|
||
|
|
||
|
|
||
|
4.1.0 -- December, 2020
|
||
|
-----------------------
|
||
|
|
||
|
- collectives: Add HAN and ADAPT adaptive collectives components.
|
||
|
Both components are off by default and can be enabled by specifying
|
||
|
"mpirun --mca coll_adapt_priority 100 --mca coll_han_priority 100 ...".
|
||
|
We intend to enable both by default in Open MPI 5.0.
|
||
|
- OMPIO is now the default for MPI-IO on all filesystems, including
|
||
|
Lustre (prior to this, ROMIO was the default for Lustre). Many
|
||
|
thanks to Mark Dixon for identifying MPI I/O issues and providing
|
||
|
access to Lustre systems for testing.
|
||
|
- Updates for macOS Big Sur. Thanks to FX Coudert for reporting this
|
||
|
issue and pointing to a solution.
|
||
|
- Minor MPI one-sided RDMA performance improvements.
|
||
|
- Fix hcoll MPI_SCATTERV with MPI_IN_PLACE.
|
||
|
- Add AVX support for MPI collectives.
|
||
|
- Updates to mpirun(1) about "slots" and PE=x values.
|
||
|
- Fix buffer allocation for large environment variables. Thanks to
|
||
|
@zrss for reporting the issue.
|
||
|
- Upgrade the embedded OpenPMIx to v3.2.2.
|
||
|
- Take more steps towards creating fully Reproducible builds (see
|
||
|
https://reproducible-builds.org/). Thanks Bernhard M. Wiedemann for
|
||
|
bringing this to our attention.
|
||
|
- Fix issue with extra-long values in MCA files. Thanks to GitHub
|
||
|
user @zrss for bringing the issue to our attention.
|
||
|
- UCX: Fix zero-sized datatype transfers.
|
||
|
- Fix --cpu-list for non-uniform modes.
|
||
|
- Fix issue in PMIx callback caused by missing memory barrier on Arm platforms.
|
||
|
- OFI MTL: Various bug fixes.
|
||
|
- Fixed issue where MPI_TYPE_CREATE_RESIZED would create a datatype
|
||
|
with unexpected extent on oddly-aligned datatypes.
|
||
|
- collectives: Adjust default tuning thresholds for many collective
|
||
|
algorithms
|
||
|
- runtime: fix situation where rank-by argument does not work
|
||
|
- Portals4: Clean up error handling corner cases
|
||
|
- runtime: Remove --enable-install-libpmix option, which has not
|
||
|
worked since it was added
|
||
|
- opal: Disable memory patcher component on MacOS
|
||
|
- UCX: Allow UCX 1.8 to be used with the btl uct
|
||
|
- UCX: Replace usage of the deprecated NB API of UCX with NBX
|
||
|
- OMPIO: Add support for the IME file system
|
||
|
- OFI/libfabric: Added support for multiple NICs
|
||
|
- OFI/libfabric: Added support for Scalable Endpoints
|
||
|
- OFI/libfabric: Added btl for one-sided support
|
||
|
- OFI/libfabric: Multiple small bugfixes
|
||
|
- libnbc: Adding numerous performance-improving algorithms
|
||
|
|
||
|
4.0.7 -- November, 2021
|
||
|
-----------------------
|
||
|
|
||
|
- Fix an issue with MPI_IALLREDUCE_SCATTER when using large count
|
||
|
arguments.
|
||
|
- Fixed an issue with POST/START/COMPLETE/WAIT when using subsets
|
||
|
of processes. Thanks to Thomas Gilles for reporting.
|
||
|
- Numerous fixes from vendor partners.
|
||
|
- Fix a problem with a couple of MPI_IALLREDUCE algorithms. Thanks to
|
||
|
John Donners for reporting.
|
||
|
- Fix an edge case where MPI_Reduce is invoked with zero count and NULL
|
||
|
source and destination buffers.
|
||
|
- Use the mfence instruction in opal_atomic_rmb on x86_64 cpus. Thanks
|
||
|
to George Katevenis for proposing a fix.
|
||
|
- Fix an issue with the Open MPI build system using the SLURM provided
|
||
|
PMIx when not requested by the user. Thanks to Alexander Grund for
|
||
|
reporting.
|
||
|
- Fix a problem compiling Open MPI with clang on case-insensitive
|
||
|
file systems. Thanks to @srpgilles for reporting.
|
||
|
- Fix some OFI usNIC/OFI MTL interaction problems. Thanks to
|
||
|
@roguephysicist reporting this issue.
|
||
|
- Fix a problem with the Posix fbtl component failing to load.
|
||
|
Thanks to Honggang Li for reporting.
|
||
|
|
||
|
4.0.6 -- June, 2021
|
||
|
-------------------
|
||
|
|
||
|
- Update embedded PMIx to 3.2.3. This update addresses several
|
||
|
MPI_COMM_SPAWN problems.
|
||
|
- Fix an issue with MPI_FILE_GET_BYTE_OFFSET when supplying a
|
||
|
zero size file view. Thanks to @shanedsnyder for reporting.
|
||
|
- Fix an issue with MPI_COMM_SPLIT_TYPE not observing key correctly.
|
||
|
Thanks to Wolfgang Bangerth for reporting.
|
||
|
- Fix a derived datatype issue that could lead to potential data
|
||
|
corruption when using UCX. Thanks to @jayeshkrishna for reporting.
|
||
|
- Fix a problem with shared memory transport file name collisions.
|
||
|
Thanks to Moritz Kreutzer for reporting.
|
||
|
- Fix a problem when using Flux PMI and UCX. Thanks to Sami Ilvonen
|
||
|
for reporting and supplying a fix.
|
||
|
- Fix a problem with MPIR breakpoint being compiled out using PGI
|
||
|
compilers. Thanks to @louisespellacy-arm for reporting.
|
||
|
- Fix some ROMIO issues when using Lustre. Thanks to Mark Dixon for
|
||
|
reporting.
|
||
|
- Fix a problem using an external PMIx 4 to build Open MPI 4.0.x.
|
||
|
- Fix a compile problem when using the enable-timing configure option
|
||
|
and UCX. Thanks to Jan Bierbaum for reporting.
|
||
|
- Fix a symbol name collision when using the Cray compiler to build
|
||
|
Open SHMEM. Thanks to Pak Lui for reporting and fixing.
|
||
|
- Correct an issue encountered when building Open MPI under OSX Big Sur.
|
||
|
Thanks to FX Coudert for reporting.
|
||
|
- Various fixes to the OFI MTL.
|
||
|
- Fix an issue with allocation of sufficient memory for parsing long
|
||
|
environment variable values. Thanks to @zrss for reporting.
|
||
|
- Improve reproducibility of builds to assist Open MPI packages.
|
||
|
Thanks to Bernhard Wiedmann for bringing this to our attention.
|
||
|
|
||
|
4.0.5 -- August, 2020
|
||
|
---------------------
|
||
|
|
||
|
- Fix a problem with MPI RMA compare and swap operations. Thanks
|
||
|
to Wojciech Chlapek for reporting.
|
||
|
- Disable binding of MPI processes to system resources by Open MPI
|
||
|
if an application is launched using SLURM's srun command.
|
||
|
- Disable building of the Fortran mpi_f08 module when configuring
|
||
|
Open MPI with default 8 byte Fortran integer size. Thanks to
|
||
|
@ahcien for reporting.
|
||
|
- Fix a problem with mpirun when the --map-by option is used.
|
||
|
Thanks to Wenbin Lyu for reporting.
|
||
|
- Fix some issues with MPI one-sided operations uncovered using Global
|
||
|
Arrays regression test-suite. Thanks to @bjpalmer for reporting.
|
||
|
- Fix a problem with make check when using the PGI compiler. Thanks to
|
||
|
Carl Ponder for reporting.
|
||
|
- Fix a problem with MPI_FILE_READ_AT_ALL that could lead to application
|
||
|
hangs under certain circumstances. Thanks to Scot Breitenfeld for
|
||
|
reporting.
|
||
|
- Fix a problem building C++ applications with newer versions of GCC.
|
||
|
Thanks to Constantine Khrulev for reporting.
|
||
|
|
||
|
4.0.4 -- June, 2020
|
||
|
-----------------------
|
||
|
|
||
|
- Fix a memory patcher issue intercepting shmat and shmdt. This was
|
||
|
observed on RHEL 8.x ppc64le (see README for more info).
|
||
|
- Fix an illegal access issue caught using gcc's address sanitizer.
|
||
|
Thanks to Georg Geiser for reporting.
|
||
|
- Add checks to avoid conflicts with a libevent library shipped with LSF.
|
||
|
- Switch to linking against libevent_core rather than libevent, if present.
|
||
|
- Add improved support for UCX 1.9 and later.
|
||
|
- Fix an ABI compatibility issue with the Fortran 2008 bindings.
|
||
|
Thanks to Alastair McKinstry for reporting.
|
||
|
- Fix an issue with rpath of /usr/lib64 when building OMPI on
|
||
|
systems with Lustre. Thanks to David Shrader for reporting.
|
||
|
- Fix a memory leak occurring with certain MPI RMA operations.
|
||
|
- Fix an issue with ORTE's mapping of MPI processes to resources.
|
||
|
Thanks to Alex Margolin for reporting and providing a fix.
|
||
|
- Correct a problem with incorrect error codes being returned
|
||
|
by OMPI MPI_T functions.
|
||
|
- Fix an issue with debugger tools not being able to attach
|
||
|
to mpirun more than once. Thanks to Gregory Lee for reporting.
|
||
|
- Fix an issue with the Fortran compiler wrappers when using
|
||
|
NAG compilers. Thanks to Peter Brady for reporting.
|
||
|
- Fix an issue with the ORTE ssh based process launcher at scale.
|
||
|
Thanks to Benjamín Hernández for reporting.
|
||
|
- Address an issue when using shared MPI I/O operations. OMPIO will
|
||
|
now successfully return from the file open statement but will
|
||
|
raise an error if the file system does not supported shared I/O
|
||
|
operations. Thanks to Romain Hild for reporting.
|
||
|
- Fix an issue with MPI_WIN_DETACH. Thanks to Thomas Naughton for reporting.
|
||
|
|
||
|
4.0.3 -- March, 2020
|
||
|
-----------------------
|
||
|
|
||
|
- Update embedded PMIx to 3.1.5
|
||
|
- Add support for Mellanox ConnectX-6.
|
||
|
- Fix an issue in OpenMPI IO when using shared file pointers.
|
||
|
Thanks to Romain Hild for reporting.
|
||
|
- Fix a problem with Open MPI using a previously installed
|
||
|
Fortran mpi module during compilation. Thanks to Marcin
|
||
|
Mielniczuk for reporting
|
||
|
- Fix a problem with Fortran compiler wrappers ignoring use of
|
||
|
disable-wrapper-runpath configure option. Thanks to David
|
||
|
Shrader for reporting.
|
||
|
- Fixed an issue with trying to use mpirun on systems where neither
|
||
|
ssh nor rsh is installed.
|
||
|
- Address some problems found when using XPMEM for intra-node message
|
||
|
transport.
|
||
|
- Improve dimensions returned by MPI_Dims_create for certain
|
||
|
cases. Thanks to @aw32 for reporting.
|
||
|
- Fix an issue when sending messages larger than 4GB. Thanks to
|
||
|
Philip Salzmann for reporting this issue.
|
||
|
- Add ability to specify alternative module file path using
|
||
|
Open MPI's RPM spec file. Thanks to @jschwartz-cray for reporting.
|
||
|
- Clarify use of --with-hwloc configuration option in the README.
|
||
|
Thanks to Marcin Mielniczuk for raising this documentation issue.
|
||
|
- Fix an issue with shmem_atomic_set. Thanks to Sameh Sharkawi for reporting.
|
||
|
- Fix a problem with MPI_Neighbor_alltoall(v,w) for cartesian communicators
|
||
|
with cyclic boundary conditions. Thanks to Ralph Rabenseifner and
|
||
|
Tony Skjellum for reporting.
|
||
|
- Fix an issue using Open MPIO on 32 bit systems. Thanks to
|
||
|
Orion Poplawski for reporting.
|
||
|
- Fix an issue with NetCDF test deadlocking when using the vulcan
|
||
|
Open MPIO component. Thanks to Orion Poplawski for reporting.
|
||
|
- Fix an issue with the mpi_yield_when_idle parameter being ignored
|
||
|
when set in the Open MPI MCA parameter configuration file.
|
||
|
Thanks to @iassiour for reporting.
|
||
|
- Address an issue with Open MPIO when writing/reading more than 2GB
|
||
|
in an operation. Thanks to Richard Warren for reporting.
|
||
|
|
||
|
4.0.2 -- September, 2019
|
||
|
------------------------
|
||
|
|
||
|
- Update embedded PMIx to 3.1.4
|
||
|
- Enhance Open MPI to detect when processes are running in
|
||
|
different name spaces on the same node, in which case the
|
||
|
vader CMA single copy mechanism is disabled. Thanks
|
||
|
to Adrian Reber for reporting and providing a fix.
|
||
|
- Fix an issue with ORTE job tree launch mechanism. Thanks
|
||
|
to @lanyangyang for reporting.
|
||
|
- Fix an issue with env processing when running as root.
|
||
|
Thanks to Simon Byrne for reporting and providing a fix.
|
||
|
- Fix Fortran MPI_FILE_GET_POSITION return code bug.
|
||
|
Thanks to Wei-Keng Liao for reporting.
|
||
|
- Fix user defined datatypes/ops leak in nonblocking base collective
|
||
|
component. Thanks to Andrey Maslennikov for verifying fix.
|
||
|
- Fixed shared memory not working with spawned processes.
|
||
|
Thanks to @rodarima for reporting.
|
||
|
- Fix data corruption of overlapping datatypes on sends.
|
||
|
Thanks to DKRZ for reporting.
|
||
|
- Fix segfault in oob_tcp component on close with active listeners.
|
||
|
Thanks to Orivej Desh for reporting and providing a fix.
|
||
|
- Fix divide by zero segfault in ompio.
|
||
|
Thanks to @haraldkl for reporting and providing a fix.
|
||
|
- Fix finalize of flux compnents.
|
||
|
Thanks to Stephen Herbein and Jim Garlick for providing a fix.
|
||
|
- Fix osc_rdma_acc_single_intrinsic regression.
|
||
|
Thanks to Joseph Schuchart for reporting and providing a fix.
|
||
|
- Fix hostnames with large integers.
|
||
|
Thanks to @perrynzhou for reporting and providing a fix.
|
||
|
- Fix Deadlock in MPI_Fetch_and_op when using UCX
|
||
|
Thanks to Joseph Schuchart for reporting.
|
||
|
- Fix the SLURM plm for mpirun-based launching.
|
||
|
Thanks to Jordon Hayes for reporting and providing a fix.
|
||
|
- Prevent grep failure in rpmbuild from aborting.
|
||
|
Thanks to Daniel Letai for reporting.
|
||
|
- Fix btl/vader finalize sequence.
|
||
|
Thanks to Daniel Vollmer for reporting.
|
||
|
- Fix pml/ob1 local handle sent during PUT control message.
|
||
|
Thanks to @EmmanuelBRELLE for reporting and providing a fix.
|
||
|
- Fix Memory leak with persistent MPI sends and the ob1 "get" protocol.
|
||
|
Thanks to @s-kuberski for reporting.
|
||
|
- v4.0.x: mpi: mark MPI_COMBINER_{HVECTOR,HINDEXED,STRUCT}_INTEGER
|
||
|
removed unless configured with --enable-mpi1-compatibility
|
||
|
- Fix make-authors.pl when run in a git submodule.
|
||
|
Thanks to Michael Heinz for reporting and providing a fix.
|
||
|
- Fix deadlock with mpi_assert_allow_overtaking in MPI_Issend.
|
||
|
Thanks to Joseph Schuchart and George Bosilca for reporting.
|
||
|
- Add compilation flag to allow unwinding through files that are
|
||
|
present in the stack when attaching with MPIR.
|
||
|
Thanks to James A Clark for reporting and providing a fix.
|
||
|
|
||
|
Known issues:
|
||
|
|
||
|
- There is a known issue with the OFI libfabric and PSM2 MTLs when trying to send
|
||
|
very long (> 4 GBytes) messages. In this release, these MTLs will catch
|
||
|
this case and abort the transfer. A future release will provide a
|
||
|
better solution to this issue.
|
||
|
|
||
|
4.0.1 -- March, 2019
|
||
|
--------------------
|
||
|
|
||
|
- Update embedded PMIx to 3.1.2.
|
||
|
- Fix an issue with Vader (shared-memory) transport on OS-X. Thanks
|
||
|
to Daniel Vollmer for reporting.
|
||
|
- Fix a problem with the usNIC BTL Makefile. Thanks to George Marselis
|
||
|
for reporting.
|
||
|
- Fix an issue when using --enable-visibility configure option
|
||
|
and older versions of hwloc. Thanks to Ben Menadue for reporting
|
||
|
and providing a fix.
|
||
|
- Fix an issue with MPI_WIN_CREATE_DYNAMIC and MPI_GET from self.
|
||
|
Thanks to Bart Janssens for reporting.
|
||
|
- Fix an issue of excessive compiler warning messages from mpi.h
|
||
|
when using newer C++ compilers. Thanks to @Shadow-fax for
|
||
|
reporting.
|
||
|
- Fix a problem when building Open MPI using clang 5.0.
|
||
|
- Fix a problem with MPI_WIN_CREATE when using UCX. Thanks
|
||
|
to Adam Simpson for reporting.
|
||
|
- Fix a memory leak encountered for certain MPI datatype
|
||
|
destructor operations. Thanks to Axel Huebl for reporting.
|
||
|
- Fix several problems with MPI RMA accumulate operations.
|
||
|
Thanks to Jeff Hammond for reporting.
|
||
|
- Fix possible race condition in closing some file descriptors
|
||
|
during job launch using mpirun. Thanks to Jason Williams
|
||
|
for reporting and providing a fix.
|
||
|
- Fix a problem in OMPIO for large individual write operations.
|
||
|
Thanks to Axel Huebl for reporting.
|
||
|
- Fix a problem with parsing of map-by ppr options to mpirun.
|
||
|
Thanks to David Rich for reporting.
|
||
|
- Fix a problem observed when using the mpool hugepage component. Thanks
|
||
|
to Hunter Easterday for reporting and fixing.
|
||
|
- Fix valgrind warning generated when invoking certain MPI Fortran
|
||
|
data type creation functions. Thanks to @rtoijala for reporting.
|
||
|
- Fix a problem when trying to build with a PMIX 3.1 or newer
|
||
|
release. Thanks to Alastair McKinstry for reporting.
|
||
|
- Fix a problem encountered with building MPI F08 module files.
|
||
|
Thanks to Igor Andriyash and Axel Huebl for reporting.
|
||
|
- Fix two memory leaks encountered for certain MPI-RMA usage patterns.
|
||
|
Thanks to Joseph Schuchart for reporting and fixing.
|
||
|
- Fix a problem with the ORTE rmaps_base_oversubscribe MCA paramater.
|
||
|
Thanks to @iassiour for reporting.
|
||
|
- Fix a problem with UCX PML default error handler for MPI communicators.
|
||
|
Thanks to Marcin Krotkiewski for reporting.
|
||
|
- Fix various issues with OMPIO uncovered by the testmpio test suite.
|
||
|
|
||
|
4.0.0 -- September, 2018
|
||
|
------------------------
|
||
|
|
||
|
- OSHMEM updated to the OpenSHMEM 1.4 API.
|
||
|
- Do not build OpenSHMEM layer when there are no SPMLs available.
|
||
|
Currently, this means the OpenSHMEM layer will only build if
|
||
|
a MXM or UCX library is found.
|
||
|
- A UCX BTL was added for enhanced MPI RMA support using UCX
|
||
|
- With this release, OpenIB BTL now only supports iWarp and RoCE by default.
|
||
|
- Updated internal HWLOC to 2.0.2
|
||
|
- Updated internal PMIx to 3.0.2
|
||
|
- Change the priority for selecting external verses internal HWLOC
|
||
|
and PMIx packages to build. Starting with this release, configure
|
||
|
by default selects available external HWLOC and PMIx packages over
|
||
|
the internal ones.
|
||
|
- Updated internal ROMIO to 3.2.1.
|
||
|
- Removed support for the MXM MTL.
|
||
|
- Removed support for SCIF.
|
||
|
- Improved CUDA support when using UCX.
|
||
|
- Enable use of CUDA allocated buffers for OMPIO.
|
||
|
- Improved support for two phase MPI I/O operations when using OMPIO.
|
||
|
- Added support for Software-based Performance Counters, see
|
||
|
https://github.com/davideberius/ompi/wiki/How-to-Use-Software-Based-Performance-Counters-(SPCs)-in-Open-MPI
|
||
|
- Change MTL OFI from opting-IN on "psm,psm2,gni" to opting-OUT on
|
||
|
"shm,sockets,tcp,udp,rstream"
|
||
|
- Various improvements to MPI RMA performance when using RDMA
|
||
|
capable interconnects.
|
||
|
- Update memkind component to use the memkind 1.6 public API.
|
||
|
- Fix a problem with javadoc builds using OpenJDK 11. Thanks to
|
||
|
Siegmar Gross for reporting.
|
||
|
- Fix a memory leak using UCX. Thanks to Charles Taylor for reporting.
|
||
|
- Fix hangs in MPI_FINALIZE when using UCX.
|
||
|
- Fix a problem with building Open MPI using an external PMIx 2.1.2
|
||
|
library. Thanks to Marcin Krotkiewski for reporting.
|
||
|
- Fix race conditions in Vader (shared memory) transport.
|
||
|
- Fix problems with use of newer map-by mpirun options. Thanks to
|
||
|
Tony Reina for reporting.
|
||
|
- Fix rank-by algorithms to properly rank by object and span
|
||
|
- Allow for running as root of two environment variables are set.
|
||
|
Requested by Axel Huebl.
|
||
|
- Fix a problem with building the Java bindings when using Java 10.
|
||
|
Thanks to Bryce Glover for reporting.
|
||
|
- Fix a problem with ORTE not reporting error messages if an application
|
||
|
terminated normally but exited with non-zero error code. Thanks to
|
||
|
Emre Brookes for reporting.
|
||
|
|
||
|
3.1.6 -- March, 2020
|
||
|
--------------------
|
||
|
|
||
|
- Fix one-sided shared memory window configuration bug.
|
||
|
- Fix support for PGI'18 compiler.
|
||
|
- Fix issue with zero-length blockLength in MPI_TYPE_INDEXED.
|
||
|
- Fix run-time linker issues with OMPIO on newer Linux distros.
|
||
|
- Fix PMIX dstore locking compilation issue. Thanks to Marco Atzeri
|
||
|
for reporting the issue.
|
||
|
- Allow the user to override modulefile_path in the Open MPI SRPM,
|
||
|
even if install_in_opt is set to 1.
|
||
|
- Properly detect ConnectX-6 HCAs in the openib BTL.
|
||
|
- Fix segfault in the MTL/OFI initialization for large jobs.
|
||
|
- Fix issue to guarantee to properly release MPI one-sided lock when
|
||
|
using UCX transports to avoid a deadlock.
|
||
|
- Fix potential deadlock when processing outstanding transfers with
|
||
|
uGNI transports.
|
||
|
- Fix various portals4 control flow bugs.
|
||
|
- Fix communications ordering for alltoall and Cartesian neighborhood
|
||
|
collectives.
|
||
|
- Fix an infinite recursion crash in the memory patcher on systems
|
||
|
with glibc v2.26 or later (e.g., Ubuntu 18.04) when using certain
|
||
|
OS-bypass interconnects.
|
||
|
|
||
|
3.1.5 -- November, 2019
|
||
|
-----------------------
|
||
|
|
||
|
- Fix OMPIO issue limiting file reads/writes to 2GB. Thanks to
|
||
|
Richard Warren for reporting the issue.
|
||
|
- At run time, automatically disable Linux cross-memory attach (CMA)
|
||
|
for vader BTL (shared memory) copies when running in user namespaces
|
||
|
(i.e., containers). Many thanks to Adrian Reber for raising the
|
||
|
issue and providing the fix.
|
||
|
- Sending very large MPI messages using the ofi MTL will fail with
|
||
|
some of the underlying Libfabric transports (e.g., PSM2 with
|
||
|
messages >=4GB, verbs with messages >=2GB). Prior version of Open
|
||
|
MPI failed silently; this version of Open MPI invokes the
|
||
|
appropriate MPI error handler upon failure. See
|
||
|
https://github.com/open-mpi/ompi/issues/7058 for more details.
|
||
|
Thanks to Emmanuel Thomé for raising the issue.
|
||
|
- Fix case where 0-extent datatypes might be eliminated during
|
||
|
optimization. Thanks to Github user @tjahns for raising the issue.
|
||
|
- Ensure that the MPIR_Breakpoint symbol is not optimized out on
|
||
|
problematic platforms.
|
||
|
- Fix MPI one-sided 32 bit atomic support.
|
||
|
- Fix OMPIO offset calculations with SEEK_END and SEEK_CUR in
|
||
|
MPI_FILE_GET_POSITION. Thanks to Wei-keng Liao for raising the
|
||
|
issue.
|
||
|
- Add "naive" regx component that will never fail, no matter how
|
||
|
esoteric the hostnames are.
|
||
|
- Fix corner case for datatype extent computations. Thanks to David
|
||
|
Dickenson for raising the issue.
|
||
|
- Allow individual jobs to set their map/rank/bind policies when
|
||
|
running LSF. Thanks to Nick R. Papior for assistance in solving the
|
||
|
issue.
|
||
|
- Fix MPI buffered sends with the "cm" PML.
|
||
|
- Properly propagate errors to avoid deadlocks in MPI one-sided operations.
|
||
|
- Update to PMIx v2.2.3.
|
||
|
- Fix data corruption in non-contiguous MPI accumulates over UCX.
|
||
|
- Fix ssh-based tree-based spawning at scale. Many thanks to Github
|
||
|
user @zrss for the report and diagnosis.
|
||
|
- Fix the Open MPI RPM spec file to not abort when grep fails. Thanks
|
||
|
to Daniel Letai for bringing this to our attention.
|
||
|
- Handle new SLURM CLI options (SLURM 19 deprecated some options that
|
||
|
Open MPI was using). Thanks to Jordan Hayes for the report and the
|
||
|
initial fix.
|
||
|
- OMPI: fix division by zero with an empty file view.
|
||
|
- Also handle shmat()/shmdt() memory patching with OS-bypass networks.
|
||
|
- Add support for unwinding info to all files that are present in the
|
||
|
stack starting from MPI_Init, which is helpful with parallel
|
||
|
debuggers. Thanks to James Clark for the report and initial fix.
|
||
|
- Fixed inadvertant use of bitwise operators in the MPI C++ bindings
|
||
|
header files. Thanks to Bert Wesarg for the report and the fix.
|
||
|
|
||
|
3.1.4 -- April, 2019
|
||
|
--------------------
|
||
|
|
||
|
- Fix compile error when configured with --enable-mpi-java and
|
||
|
--with-devel-headers. Thanks to @g-raffy for reporting the issue
|
||
|
(** also appeared: v3.0.4).
|
||
|
- Only use hugepages with appropriate permissions. Thanks to Hunter
|
||
|
Easterday for the fix.
|
||
|
- Fix possible floating point rounding and division issues in OMPIO
|
||
|
which led to crashes and/or data corruption with very large data.
|
||
|
Thanks to Axel Huebl and René Widera for identifing the issue,
|
||
|
supplying and testing the fix (** also appeared: v3.0.4).
|
||
|
- Use static_cast<> in mpi.h where appropriate. Thanks to @shadow-fx
|
||
|
for identifying the issue (** also appeared: v3.0.4).
|
||
|
- Fix RMA accumulate of non-predefined datatypes with predefined
|
||
|
operators. Thanks to Jeff Hammond for raising the issue (** also
|
||
|
appeared: v3.0.4).
|
||
|
- Fix race condition when closing open file descriptors when launching
|
||
|
MPI processes. Thanks to Jason Williams for identifying the issue and
|
||
|
supplying the fix (** also appeared: v3.0.4).
|
||
|
- Fix support for external PMIx v3.1.x.
|
||
|
- Fix Valgrind warnings for some MPI_TYPE_CREATE_* functions. Thanks
|
||
|
to Risto Toijala for identifying the issue and supplying the fix (**
|
||
|
also appeared: v3.0.4).
|
||
|
- Fix MPI_TYPE_CREATE_F90_{REAL,COMPLEX} for r=38 and r=308 (** also
|
||
|
appeared: v3.0.4).
|
||
|
- Fix assembly issues with old versions of gcc (<6.0.0) that affected
|
||
|
the stability of shared memory communications (e.g., with the vader
|
||
|
BTL) (** also appeared: v3.0.4).
|
||
|
- Fix MPI_Allreduce crashes with some cases in the coll/spacc module.
|
||
|
- Fix the OFI MTL handling of MPI_ANY_SOURCE (** also appeared:
|
||
|
v3.0.4).
|
||
|
- Fix noisy errors in the openib BTL with regards to
|
||
|
ibv_exp_query_device(). Thanks to Angel Beltre and others who
|
||
|
reported the issue (** also appeared: v3.0.4).
|
||
|
- Fix zero-size MPI one-sided windows with UCX.
|
||
|
|
||
|
3.1.3 -- October, 2018
|
||
|
----------------------
|
||
|
|
||
|
- Fix race condition in MPI_THREAD_MULTIPLE support of non-blocking
|
||
|
send/receive path.
|
||
|
- Fix error handling SIGCHLD forwarding.
|
||
|
- Add support for CHARACTER and LOGICAL Fortran datatypes for MPI_SIZEOF.
|
||
|
- Fix compile error when using OpenJDK 11 to compile the Java bindings.
|
||
|
- Fix crash when using a hostfile with a 'user@host' line.
|
||
|
- Numerous Fortran '08 interface fixes.
|
||
|
- TCP BTL error message fixes.
|
||
|
- OFI MTL now will use any provider other than shm, sockets, tcp, udp, or
|
||
|
rstream, rather than only supporting gni, psm, and psm2.
|
||
|
- Disable async receive of CUDA buffers by default, fixing a hang
|
||
|
on large transfers.
|
||
|
- Support the BCM57XXX and BCM58XXX Broadcomm adapters.
|
||
|
- Fix minmax datatype support in ROMIO.
|
||
|
- Bug fixes in vader shared memory transport.
|
||
|
- Support very large buffers with MPI_TYPE_VECTOR.
|
||
|
- Fix hang when launching with mpirun on Cray systems.
|
||
|
|
||
|
3.1.2 -- August, 2018
|
||
|
------------------------
|
||
|
|
||
|
- A subtle race condition bug was discovered in the "vader" BTL
|
||
|
(shared memory communications) that, in rare instances, can cause
|
||
|
MPI processes to crash or incorrectly classify (or effectively drop)
|
||
|
an MPI message sent via shared memory. If you are using the "ob1"
|
||
|
PML with "vader" for shared memory communication (note that vader is
|
||
|
the default for shared memory communication with ob1), you need to
|
||
|
upgrade to v3.1.2 or later to fix this issue. You may also upgrade
|
||
|
to the following versions to fix this issue:
|
||
|
- Open MPI v2.1.5 (expected end of August, 2018) or later in the
|
||
|
v2.1.x series
|
||
|
- Open MPI v3.0.1 (released March, 2018) or later in the v3.0.x
|
||
|
series
|
||
|
- Assorted Portals 4.0 bug fixes.
|
||
|
- Fix for possible data corruption in MPI_BSEND.
|
||
|
- Move shared memory file for vader btl into /dev/shm on Linux.
|
||
|
- Fix for MPI_ISCATTER/MPI_ISCATTERV Fortran interfaces with MPI_IN_PLACE.
|
||
|
- Upgrade PMIx to v2.1.3.
|
||
|
- Numerous One-sided bug fixes.
|
||
|
- Fix for race condition in uGNI BTL.
|
||
|
- Improve handling of large number of interfaces with TCP BTL.
|
||
|
- Numerous UCX bug fixes.
|
||
|
|
||
|
3.1.1 -- June, 2018
|
||
|
-------------------
|
||
|
|
||
|
- Fix potential hang in UCX PML during MPI_FINALIZE
|
||
|
- Update internal PMIx to v2.1.2rc2 to fix forward version compatibility.
|
||
|
- Add new MCA parameter osc_sm_backing_store to allow users to specify
|
||
|
where in the filesystem the backing file for the shared memory
|
||
|
one-sided component should live. Defaults to /dev/shm on Linux.
|
||
|
- Fix potential hang on non-x86 platforms when using builds with
|
||
|
optimization flags turned off.
|
||
|
- Disable osc/pt2pt when using MPI_THREAD_MULTIPLE due to numerous
|
||
|
race conditions in the component.
|
||
|
- Fix dummy variable names for the mpi and mpi_f08 Fortran bindings to
|
||
|
match the MPI standard. This may break applications which use
|
||
|
name-based parameters in Fortran which used our internal names
|
||
|
rather than those documented in the MPI standard.
|
||
|
- Revamp Java detection to properly handle new Java versions which do
|
||
|
not provide a javah wrapper.
|
||
|
- Fix RMA function signatures for use-mpi-f08 bindings to have the
|
||
|
asynchonous property on all buffers.
|
||
|
- Improved configure logic for finding the UCX library.
|
||
|
|
||
|
3.1.0 -- May, 2018
|
||
|
------------------
|
||
|
|
||
|
- Various OpenSHMEM bug fixes.
|
||
|
- Properly handle array_of_commands argument to Fortran version of
|
||
|
MPI_COMM_SPAWN_MULTIPLE.
|
||
|
- Fix bug with MODE_SEQUENTIAL and the sharedfp MPI-IO component.
|
||
|
- Use "javac -h" instead of "javah" when building the Java bindings
|
||
|
with a recent version of Java.
|
||
|
- Fix mis-handling of jostepid under SLURM that could cause problems
|
||
|
with PathScale/OmniPath NICs.
|
||
|
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
|
still not a supported platform, but it is no longer automatically
|
||
|
disabled. See
|
||
|
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
|
for more information.
|
||
|
- The output-filename option for mpirun is now converted to an
|
||
|
absolute path before being passed to other nodes.
|
||
|
- Add monitoring component for PML, OSC, and COLL to track data
|
||
|
movement of MPI applications. See
|
||
|
ompi/mca/commmon/monitoring/HowTo_pml_monitoring.tex for more
|
||
|
information about the monitoring framework.
|
||
|
- Add support for communicator assertions: mpi_assert_no_any_tag,
|
||
|
mpi_assert_no_any_source, mpi_assert_exact_length, and
|
||
|
mpi_assert_allow_overtaking.
|
||
|
- Update PMIx to version 2.1.1.
|
||
|
- Update hwloc to 1.11.7.
|
||
|
- Many one-sided behavior fixes.
|
||
|
- Improved performance for Reduce and Allreduce using Rabenseifner's algorithm.
|
||
|
- Revamped mpirun --help output to make it a bit more manageable.
|
||
|
- Portals4 MTL improvements: Fix race condition in rendezvous protocol and
|
||
|
retry logic.
|
||
|
- UCX OSC: initial implementation.
|
||
|
- UCX PML improvements: add multi-threading support.
|
||
|
- Yalla PML improvements: Fix error with irregular contiguous datatypes.
|
||
|
- Openib BTL: disable XRC support by default.
|
||
|
- TCP BTL: Add check to detect and ignore connections from processes
|
||
|
that aren't MPI (such as IDS probes) and verify that source and
|
||
|
destination are using the same version of Open MPI, fix issue with very
|
||
|
large message transfer.
|
||
|
- ompi_info parsable output now escapes double quotes in values, and
|
||
|
also quotes values can contains colons. Thanks to Lev Givon for the
|
||
|
suggestion.
|
||
|
- CUDA-aware support can now handle GPUs within a node that do not
|
||
|
support CUDA IPC. Earlier versions would get error and abort.
|
||
|
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
|
MPI processes on the same node where mpirun is executing using a separate
|
||
|
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
|
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
|
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
|
- Remove LoadLeveler RAS support.
|
||
|
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
|
- Add functionality for IBM s390 platforms. Note that regular
|
||
|
regression testing does not occur on the s390 and it is not
|
||
|
considered a supported platform.
|
||
|
- Remove support for big endian PowerPC.
|
||
|
- Remove support for XL compilers older than v13.1.
|
||
|
- Remove support for atomic operations using MacOS atomics library.
|
||
|
|
||
|
3.0.6 -- March, 2020
|
||
|
--------------------
|
||
|
|
||
|
- Fix one-sided shared memory window configuration bug.
|
||
|
- Fix support for PGI'18 compiler.
|
||
|
- Fix run-time linker issues with OMPIO on newer Linux distros.
|
||
|
- Allow the user to override modulefile_path in the Open MPI SRPM,
|
||
|
even if install_in_opt is set to 1.
|
||
|
- Properly detect ConnectX-6 HCAs in the openib BTL.
|
||
|
- Fix segfault in the MTL/OFI initialization for large jobs.
|
||
|
- Fix various portals4 control flow bugs.
|
||
|
- Fix communications ordering for alltoall and Cartesian neighborhood
|
||
|
collectives.
|
||
|
- Fix an infinite recursion crash in the memory patcher on systems
|
||
|
with glibc v2.26 or later (e.g., Ubuntu 18.04) when using certain
|
||
|
OS-bypass interconnects.
|
||
|
|
||
|
3.0.5 -- November, 2019
|
||
|
-----------------------
|
||
|
|
||
|
- Fix OMPIO issue limiting file reads/writes to 2GB. Thanks to
|
||
|
Richard Warren for reporting the issue.
|
||
|
- At run time, automatically disable Linux cross-memory attach (CMA)
|
||
|
for vader BTL (shared memory) copies when running in user namespaces
|
||
|
(i.e., containers). Many thanks to Adrian Reber for raising the
|
||
|
issue and providing the fix.
|
||
|
- Sending very large MPI messages using the ofi MTL will fail with
|
||
|
some of the underlying Libfabric transports (e.g., PSM2 with
|
||
|
messages >=4GB, verbs with messages >=2GB). Prior version of Open
|
||
|
MPI failed silently; this version of Open MPI invokes the
|
||
|
appropriate MPI error handler upon failure. See
|
||
|
https://github.com/open-mpi/ompi/issues/7058 for more details.
|
||
|
Thanks to Emmanuel Thomé for raising the issue.
|
||
|
- Fix case where 0-extent datatypes might be eliminated during
|
||
|
optimization. Thanks to Github user @tjahns for raising the issue.
|
||
|
- Ensure that the MPIR_Breakpoint symbol is not optimized out on
|
||
|
problematic platforms.
|
||
|
- Fix OMPIO offset calculations with SEEK_END and SEEK_CUR in
|
||
|
MPI_FILE_GET_POSITION. Thanks to Wei-keng Liao for raising the
|
||
|
issue.
|
||
|
- Fix corner case for datatype extent computations. Thanks to David
|
||
|
Dickenson for raising the issue.
|
||
|
- Fix MPI buffered sends with the "cm" PML.
|
||
|
- Update to PMIx v2.2.3.
|
||
|
- Fix ssh-based tree-based spawning at scale. Many thanks to Github
|
||
|
user @zrss for the report and diagnosis.
|
||
|
- Fix the Open MPI RPM spec file to not abort when grep fails. Thanks
|
||
|
to Daniel Letai for bringing this to our attention.
|
||
|
- Handle new SLURM CLI options (SLURM 19 deprecated some options that
|
||
|
Open MPI was using). Thanks to Jordan Hayes for the report and the
|
||
|
initial fix.
|
||
|
- OMPI: fix division by zero with an empty file view.
|
||
|
- Also handle shmat()/shmdt() memory patching with OS-bypass networks.
|
||
|
- Add support for unwinding info to all files that are present in the
|
||
|
stack starting from MPI_Init, which is helpful with parallel
|
||
|
debuggers. Thanks to James Clark for the report and initial fix.
|
||
|
- Fixed inadvertant use of bitwise operators in the MPI C++ bindings
|
||
|
header files. Thanks to Bert Wesarg for the report and the fix.
|
||
|
- Added configure option --disable-wrappers-runpath (alongside the
|
||
|
already-existing --disable-wrappers-rpath option) to prevent Open
|
||
|
MPI's configure script from automatically adding runpath CLI options
|
||
|
to the wrapper compilers.
|
||
|
|
||
|
3.0.4 -- April, 2019
|
||
|
--------------------
|
||
|
|
||
|
- Fix compile error when configured with --enable-mpi-java and
|
||
|
--with-devel-headers. Thanks to @g-raffy for reporting the issue.
|
||
|
- Fix possible floating point rounding and division issues in OMPIO
|
||
|
which led to crashes and/or data corruption with very large data.
|
||
|
Thanks to Axel Huebl and René Widera for identifing the issue,
|
||
|
supplying and testing the fix (** also appeared: v3.0.4).
|
||
|
- Use static_cast<> in mpi.h where appropriate. Thanks to @shadow-fx
|
||
|
for identifying the issue.
|
||
|
- Fix datatype issue with RMA accumulate. Thanks to Jeff Hammond for
|
||
|
raising the issue.
|
||
|
- Fix RMA accumulate of non-predefined datatypes with predefined
|
||
|
operators. Thanks to Jeff Hammond for raising the issue.
|
||
|
- Fix race condition when closing open file descriptors when launching
|
||
|
MPI processes. Thanks to Jason Williams for identifying the issue and
|
||
|
supplying the fix.
|
||
|
- Fix Valgrind warnings for some MPI_TYPE_CREATE_* functions. Thanks
|
||
|
to Risto Toijala for identifying the issue and supplying the fix.
|
||
|
- Fix MPI_TYPE_CREATE_F90_{REAL,COMPLEX} for r=38 and r=308.
|
||
|
- Fix assembly issues with old versions of gcc (<6.0.0) that affected
|
||
|
the stability of shared memory communications (e.g., with the vader
|
||
|
BTL).
|
||
|
- Fix the OFI MTL handling of MPI_ANY_SOURCE.
|
||
|
- Fix noisy errors in the openib BTL with regards to
|
||
|
ibv_exp_query_device(). Thanks to Angel Beltre and others who
|
||
|
reported the issue.
|
||
|
|
||
|
3.0.3 -- October, 2018
|
||
|
----------------------
|
||
|
|
||
|
- Fix race condition in MPI_THREAD_MULTIPLE support of non-blocking
|
||
|
send/receive path.
|
||
|
- Fix error handling SIGCHLD forwarding.
|
||
|
- Add support for CHARACTER and LOGICAL Fortran datatypes for MPI_SIZEOF.
|
||
|
- Fix compile error when using OpenJDK 11 to compile the Java bindings.
|
||
|
- Fix crash when using a hostfile with a 'user@host' line.
|
||
|
- Numerous Fortran '08 interface fixes.
|
||
|
- TCP BTL error message fixes.
|
||
|
- OFI MTL now will use any provider other than shm, sockets, tcp, udp, or
|
||
|
rstream, rather than only supporting gni, psm, and psm2.
|
||
|
- Disable async receive of CUDA buffers by default, fixing a hang
|
||
|
on large transfers.
|
||
|
- Support the BCM57XXX and BCM58XXX Broadcomm adapters.
|
||
|
- Fix minmax datatype support in ROMIO.
|
||
|
- Bug fixes in vader shared memory transport.
|
||
|
- Support very large buffers with MPI_TYPE_VECTOR.
|
||
|
- Fix hang when launching with mpirun on Cray systems.
|
||
|
- Bug fixes in OFI MTL.
|
||
|
- Assorted Portals 4.0 bug fixes.
|
||
|
- Fix for possible data corruption in MPI_BSEND.
|
||
|
- Move shared memory file for vader btl into /dev/shm on Linux.
|
||
|
- Fix for MPI_ISCATTER/MPI_ISCATTERV Fortran interfaces with MPI_IN_PLACE.
|
||
|
- Upgrade PMIx to v2.1.4.
|
||
|
- Fix for Power9 built-in atomics.
|
||
|
- Numerous One-sided bug fixes.
|
||
|
- Fix for race condition in uGNI BTL.
|
||
|
- Improve handling of large number of interfaces with TCP BTL.
|
||
|
- Numerous UCX bug fixes.
|
||
|
- Add support for QLogic and Broadcom Cumulus RoCE HCAs to Open IB BTL.
|
||
|
- Add patcher support for aarch64.
|
||
|
- Fix hang on Power and ARM when Open MPI was built with low compiler
|
||
|
optimization settings.
|
||
|
|
||
|
3.0.2 -- June, 2018
|
||
|
-------------------
|
||
|
|
||
|
- Disable osc/pt2pt when using MPI_THREAD_MULTIPLE due to numerous
|
||
|
race conditions in the component.
|
||
|
- Fix dummy variable names for the mpi and mpi_f08 Fortran bindings to
|
||
|
match the MPI standard. This may break applications which use
|
||
|
name-based parameters in Fortran which used our internal names
|
||
|
rather than those documented in the MPI standard.
|
||
|
- Fixed MPI_SIZEOF in the "mpi" Fortran module for the NAG compiler.
|
||
|
- Fix RMA function signatures for use-mpi-f08 bindings to have the
|
||
|
asynchonous property on all buffers.
|
||
|
- Fix Fortran MPI_COMM_SPAWN_MULTIPLE to properly follow the count
|
||
|
length argument when parsing the array_of_commands variable.
|
||
|
- Revamp Java detection to properly handle new Java versions which do
|
||
|
not provide a javah wrapper.
|
||
|
- Improved configure logic for finding the UCX library.
|
||
|
- Add support for HDR InfiniBand link speeds.
|
||
|
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
|
still not a supported platform, but it is no longer automatically
|
||
|
disabled. See
|
||
|
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
|
for more information.
|
||
|
|
||
|
3.0.1 -- March, 2018
|
||
|
----------------------
|
||
|
|
||
|
- Fix ability to attach parallel debuggers to MPI processes.
|
||
|
- Fix a number of issues in MPI I/O found by the HDF5 test suite.
|
||
|
- Fix (extremely) large message transfers with shared memory.
|
||
|
- Fix out of sequence bug in multi-NIC configurations.
|
||
|
- Fix stdin redirection bug that could result in lost input.
|
||
|
- Disable the LSF launcher if CSM is detected.
|
||
|
- Plug a memory leak in MPI_Mem_free(). Thanks to Philip Blakely for reporting.
|
||
|
- Fix the tree spawn operation when the number of nodes is larger than the radix.
|
||
|
Thanks to Carlos Eduardo de Andrade for reporting.
|
||
|
- Fix Fortran 2008 macro in MPI extensions. Thanks to Nathan T. Weeks for
|
||
|
reporting.
|
||
|
- Add UCX to list of interfaces that OpenSHMEM will use by default.
|
||
|
- Add --{enable|disable}-show-load-errors-by-default to control
|
||
|
default behavior of the load errors option.
|
||
|
- OFI MTL improvements: handle empty completion queues properly, fix
|
||
|
incorrect error message around fi_getinfo(), use default progress
|
||
|
option for provider by default, Add support for reading multiple
|
||
|
CQ events in ofi_progress.
|
||
|
- PSM2 MTL improvements: Allow use of GPU buffers, thread fixes.
|
||
|
- Numerous corrections to memchecker behavior.
|
||
|
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
|
MPI processes on the same node where mpirun is executing using a separate
|
||
|
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
|
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
|
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
|
- Fix a problem reported on the mailing separately by Kevin McGrattan and Stephen
|
||
|
Guzik about consistency issues on NFS file systems when using OMPIO. This fix
|
||
|
also introduces a new mca parameter fs_ufs_lock_algorithm which allows to
|
||
|
control the locking algorithm used by ompio for read/write operations. By
|
||
|
default, ompio does not perfom locking on local UNIX file systems, locks the
|
||
|
entire file per operation on NFS file systems, and selective byte-range
|
||
|
locking on other distributed file systems.
|
||
|
- Add an mca parameter pmix_server_usock_connections to allow mpirun to
|
||
|
support applications statically built against the Open MPI v2.x release,
|
||
|
or installed in a container along with the Open MPI v2.x libraries. It is
|
||
|
set to false by default.
|
||
|
|
||
|
3.0.0 -- September, 2017
|
||
|
------------------------
|
||
|
|
||
|
Major new features:
|
||
|
|
||
|
- Use UCX allocator for OSHMEM symmetric heap allocations to optimize intra-node
|
||
|
data transfers. UCX SPML only.
|
||
|
- Use UCX multi-threaded API in the UCX PML. Requires UCX 1.0 or later.
|
||
|
- Added support for Flux PMI
|
||
|
- Update embedded PMIx to version 2.1.0
|
||
|
- Update embedded hwloc to version 1.11.7
|
||
|
|
||
|
Changes in behavior compared to prior versions:
|
||
|
|
||
|
- Per Open MPI's versioning scheme (see the README), increasing the
|
||
|
major version number to 3 indicates that this version is not
|
||
|
ABI-compatible with prior versions of Open MPI. In addition, there may
|
||
|
be differences in MCA parameter names and defaults from previous releases.
|
||
|
Command line options for mpirun and other commands may also differ from
|
||
|
previous versions. You will need to recompile MPI and OpenSHMEM applications
|
||
|
to work with this version of Open MPI.
|
||
|
- With this release, Open MPI supports MPI_THREAD_MULTIPLE by default.
|
||
|
- New configure options have been added to specify the locations of libnl
|
||
|
and zlib.
|
||
|
- A new configure option has been added to request Flux PMI support.
|
||
|
- The help menu for mpirun and related commands is now context based.
|
||
|
"mpirun --help compatibility" generates the help menu in the same format
|
||
|
as previous releases.
|
||
|
|
||
|
Removed legacy support:
|
||
|
- AIX is no longer supported.
|
||
|
- Loadlever is no longer supported.
|
||
|
- OpenSHMEM currently supports the UCX and MXM transports via the ucx and ikrit
|
||
|
SPMLs respectively.
|
||
|
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
|
- Remove support for big endian PowerPC.
|
||
|
- Remove support for XL compilers older than v13.1
|
||
|
|
||
|
Known issues:
|
||
|
|
||
|
- MPI_Connect/accept between applications started by different mpirun
|
||
|
commands will fail, even if ompi-server is running.
|
||
|
|
||
|
2.1.5 -- August 2018
|
||
|
--------------------
|
||
|
|
||
|
- A subtle race condition bug was discovered in the "vader" BTL
|
||
|
(shared memory communications) that, in rare instances, can cause
|
||
|
MPI processes to crash or incorrectly classify (or effectively drop)
|
||
|
an MPI message sent via shared memory. If you are using the "ob1"
|
||
|
PML with "vader" for shared memory communication (note that vader is
|
||
|
the default for shared memory communication with ob1), you need to
|
||
|
upgrade to v2.1.5 to fix this issue. You may also upgrade to the
|
||
|
following versions to fix this issue:
|
||
|
- Open MPI v3.0.1 (released March, 2018) or later in the v3.0.x
|
||
|
series
|
||
|
- Open MPI v3.1.2 (expected end of August, 2018) or later
|
||
|
- A link issue was fixed when the UCX library was not located in the
|
||
|
linker-default search paths.
|
||
|
|
||
|
2.1.4 -- August, 2018
|
||
|
---------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
|
still not a supported platform, but it is no longer automatically
|
||
|
disabled. See
|
||
|
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
|
for more information.
|
||
|
- Fix bug with request-based one-sided MPI operations when using the
|
||
|
"rdma" component.
|
||
|
- Fix issue with large data structure in the TCP BTL causing problems
|
||
|
in some environments. Thanks to @lgarithm for reporting the issue.
|
||
|
- Minor Cygwin build fixes.
|
||
|
- Minor fixes for the openib BTL:
|
||
|
- Support for the QLogic RoCE HCA
|
||
|
- Support for the Boradcom Cumulus RoCE HCA
|
||
|
- Enable support for HDR link speeds
|
||
|
- Fix MPI_FINALIZED hang if invoked from an attribute destructor
|
||
|
during the MPI_COMM_SELF destruction in MPI_FINALIZE. Thanks to
|
||
|
@AndrewGaspar for reporting the issue.
|
||
|
- Java fixes:
|
||
|
- Modernize Java framework detection, especially on OS X/MacOS.
|
||
|
Thanks to Bryce Glover for reporting and submitting the fixes.
|
||
|
- Prefer "javac -h" to "javah" to support newer Java frameworks.
|
||
|
- Fortran fixes:
|
||
|
- Use conformant dummy parameter names for Fortran bindings. Thanks
|
||
|
to Themos Tsikas for reporting and submitting the fixes.
|
||
|
- Build the MPI_SIZEOF() interfaces in the "TKR"-style "mpi" module
|
||
|
whenever possible. Thanks to Themos Tsikas for reporting the
|
||
|
issue.
|
||
|
- Fix array of argv handling for the Fortran bindings of
|
||
|
MPI_COMM_SPAWN_MULTIPLE (and its associated man page).
|
||
|
- Make NAG Fortran compiler support more robust in configure.
|
||
|
- Disable the "pt2pt" one-sided MPI component when MPI_THREAD_MULTIPLE
|
||
|
is used. This component is simply not safe in MPI_THREAD_MULTIPLE
|
||
|
scenarios, and will not be fixed in the v2.1.x series.
|
||
|
- Make the "external" hwloc component fail gracefully if it is tries
|
||
|
to use an hwloc v2.x.y installation. hwloc v2.x.y will not be
|
||
|
supported in the Open MPI v2.1.x series.
|
||
|
- Fix "vader" shared memory support for messages larger than 2GB.
|
||
|
Thanks to Heiko Bauke for the bug report.
|
||
|
- Configure fixes for external PMI directory detection. Thanks to
|
||
|
Davide Vanzo for the report.
|
||
|
|
||
|
2.1.3 -- March, 2018
|
||
|
--------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
- Update internal PMIx version to 1.2.5.
|
||
|
- Fix a problem with ompi_info reporting using param option.
|
||
|
Thanks to Alexander Pozdneev for reporting.
|
||
|
- Correct PMPI_Aint_{add|diff} to be functions (not subroutines)
|
||
|
in the Fortran mpi_f08 module.
|
||
|
- Fix a problem when doing MPI I/O using data types with large
|
||
|
extents in conjunction with MPI_TYPE_CREATE_SUBARRAY. Thanks to
|
||
|
Christopher Brady for reporting.
|
||
|
- Fix a problem when opening many files using MPI_FILE_OPEN.
|
||
|
Thanks to William Dawson for reporting.
|
||
|
- Fix a problem with debuggers failing to attach to a running job.
|
||
|
Thanks to Dirk Schubert for reporting.
|
||
|
- Fix a problem when using madvise and the OpenIB BTL. Thanks to
|
||
|
Timo Bingmann for reporting.
|
||
|
- Fix a problem in the Vader BTL that resulted in failures of
|
||
|
IMB under certain circumstances. Thanks to Nicolas Morey-
|
||
|
Chaisemartin for reporting.
|
||
|
- Fix a problem preventing Open MPI from working under Cygwin.
|
||
|
Thanks to Marco Atzeri for reporting.
|
||
|
- Reduce some verbosity being emitted by the USNIC BTL under certain
|
||
|
circumstances. Thanks to Peter Forai for reporting.
|
||
|
- Fix a problem with misdirection of SIGKILL. Thanks to Michael Fern
|
||
|
for reporting.
|
||
|
- Replace use of posix_memalign with malloc for small allocations. Thanks
|
||
|
to Ben Menaude for reporting.
|
||
|
- Fix a problem with Open MPI's out of band TCP network for file descriptors
|
||
|
greater than 32767. Thanks to Wojtek Wasko for reporting and fixing.
|
||
|
- Plug a memory leak in MPI_Mem_free(). Thanks to Philip Blakely for reporting.
|
||
|
|
||
|
2.1.2 -- September, 2017
|
||
|
------------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
- Update internal PMIx version to 1.2.3.
|
||
|
- Fix some problems when using the NAG Fortran compiler to build Open MPI
|
||
|
and when using the compiler wrappers. Thanks to Neil Carlson for reporting.
|
||
|
- Fix a compilation problem with the SM BTL. Thanks to Paul Hargrove for
|
||
|
reporting.
|
||
|
- Fix a problem with MPI_IALLTOALLW when using zero-length messages.
|
||
|
Thanks to Dahai Guo for reporting.
|
||
|
- Fix a problem with C11 generic type interface for SHMEM_G. Thanks
|
||
|
to Nick Park for reporting.
|
||
|
- Switch to using the lustreapi.h include file when building Open MPI
|
||
|
with Lustre support.
|
||
|
- Fix a problem in the OB1 PML that led to hangs with OSU collective tests.
|
||
|
- Fix a progression issue with MPI_WIN_FLUSH_LOCAL. Thanks to
|
||
|
Joseph Schuchart for reporting.
|
||
|
- Fix an issue with recent versions of PBSPro requiring libcrypto.
|
||
|
Thanks to Petr Hanousek for reporting.
|
||
|
- Fix a problem when using MPI_ANY_SOURCE with MPI_SENDRECV.
|
||
|
- Fix an issue that prevented signals from being propagated to ORTE
|
||
|
daemons.
|
||
|
- Ensure that signals are forwarded from ORTE daemons to all processes
|
||
|
in the process group created by the daemons. Thanks to Ted Sussman
|
||
|
for reporting.
|
||
|
- Fix a problem with launching a job under a debugger. Thanks to
|
||
|
Greg Lee for reporting.
|
||
|
- Fix a problem with Open MPI native I/O MPI_FILE_OPEN when using
|
||
|
a communicator having an associated topology. Thanks to
|
||
|
Wei-keng Liao for reporting.
|
||
|
- Fix an issue when using MPI_ACCUMULATE with derived datatypes.
|
||
|
- Fix a problem with Fortran bindings that led to compilation errors
|
||
|
for user defined reduction operations. Thanks to Nathan Weeks for
|
||
|
reporting.
|
||
|
- Fix ROMIO issues with large writes/reads when using NFS file systems.
|
||
|
- Fix definition of Fortran MPI_ARGV_NULL and MPI_ARGVS_NULL.
|
||
|
- Enable use of the head node of a SLURM allocation on Cray XC systems.
|
||
|
- Fix a problem with synchronous sends when using the UCX PML.
|
||
|
- Use default socket buffer size to improve TCP BTL performance.
|
||
|
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
|
MPI processes on the same node where mpirun is executing using a separate
|
||
|
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
|
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
|
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
|
- Fix --without-lsf when lsf is installed in the default search path.
|
||
|
- Remove support for big endian PowerPC.
|
||
|
- Remove support for XL compilers older than v13.1
|
||
|
- Remove IB XRC support from the OpenIB BTL due to loss of maintainer.
|
||
|
|
||
|
2.1.1 -- April, 2017
|
||
|
--------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
|
||
|
- Fix a problem with one of Open MPI's fifo data structures which led to
|
||
|
hangs in a make check test. Thanks to Nicolas Morey-Chaisemartin for
|
||
|
reporting.
|
||
|
- Add missing MPI_AINT_ADD/MPI_AINT_DIFF function definitions to mpif.h.
|
||
|
Thanks to Aboorva Devarajan for reporting.
|
||
|
- Fix the error return from MPI_WIN_LOCK when rank argument is invalid.
|
||
|
Thanks to Jeff Hammond for reporting and fixing this issue.
|
||
|
- Fix a problem with mpirun/orterun when started under a debugger. Thanks
|
||
|
to Gregory Leff for reporting.
|
||
|
- Add configury option to disable use of CMA by the vader BTL. Thanks
|
||
|
to Sascha Hunold for reporting.
|
||
|
- Add configury check for MPI_DOUBLE_COMPLEX datatype support.
|
||
|
Thanks to Alexander Klein for reporting.
|
||
|
- Fix memory allocated by MPI_WIN_ALLOCATE_SHARED to
|
||
|
be 64 bit aligned. Thanks to Joseph Schuchart for
|
||
|
reporting.
|
||
|
- Update MPI_WTICK man page to reflect possibly higher
|
||
|
resolution than 10e-6. Thanks to Mark Dixon for
|
||
|
reporting
|
||
|
- Add missing MPI_T_PVAR_SESSION_NULL definition to mpi.h
|
||
|
include file. Thanks to Omri Mor for this contribution.
|
||
|
- Enhance the Open MPI spec file to install modulefile in /opt
|
||
|
if installed in a non-default location. Thanks to Kevin
|
||
|
Buckley for reporting and supplying a fix.
|
||
|
- Fix a problem with conflicting PMI symbols when linking statically.
|
||
|
Thanks to Kilian Cavalotti for reporting.
|
||
|
|
||
|
Known issues (to be addressed in v2.1.2):
|
||
|
|
||
|
- See the list of fixes slated for v2.1.2 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/28
|
||
|
|
||
|
2.1.0 -- March, 2017
|
||
|
--------------------
|
||
|
|
||
|
Major new features:
|
||
|
|
||
|
- The main focus of the Open MPI v2.1.0 release was to update to PMIx
|
||
|
v1.2.1. When using PMIx (e.g., via mpirun-based launches, or via
|
||
|
direct launches with recent versions of popular resource managers),
|
||
|
launch time scalability is improved, and the run time memory
|
||
|
footprint is greatly decreased when launching large numbers of MPI /
|
||
|
OpenSHMEM processes.
|
||
|
- Update OpenSHMEM API conformance to v1.3.
|
||
|
- The usnic BTL now supports MPI_THREAD_MULTIPLE.
|
||
|
- General/overall performance improvements to MPI_THREAD_MULTIPLE.
|
||
|
- Add a summary message at the bottom of configure that tells you many
|
||
|
of the configuration options specified and/or discovered by Open
|
||
|
MPI.
|
||
|
|
||
|
Changes in behavior compared to prior versions:
|
||
|
|
||
|
- None.
|
||
|
|
||
|
Removed legacy support:
|
||
|
|
||
|
- The ptmalloc2 hooks have been removed from the Open MPI code base.
|
||
|
This is not really a user-noticable change; it is only mentioned
|
||
|
here because there was much rejoycing in the Open MPI developer
|
||
|
community.
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
|
||
|
- New MCA parameters:
|
||
|
- iof_base_redirect_app_stderr_to_stdout: as its name implies, it
|
||
|
combines MPI / OpenSHMEM applications' stderr into its stdout
|
||
|
stream.
|
||
|
- opal_event_include: allow the user to specify which FD selection
|
||
|
mechanism is used by the underlying event engine.
|
||
|
- opal_stacktrace_output: indicate where stacktraces should be sent
|
||
|
upon MPI / OpenSHMEM process crashes ("none", "stdout", "stderr",
|
||
|
"file:filename").
|
||
|
- orte_timeout_for_stack_trace: number of seconds to wait for stack
|
||
|
traces to be reported (or <=0 to wait forever).
|
||
|
- mtl_ofi_control_prog_type/mtl_ofi_data_prog_type: specify libfabric
|
||
|
progress model to be used for control and data.
|
||
|
- Fix MPI_WTICK regression where the time reported may be inaccurate
|
||
|
on systems with processor frequency scalaing enabled.
|
||
|
- Fix regression that lowered the memory maximum message bandwidth for
|
||
|
large messages on some BTL network transports, such as openib, sm,
|
||
|
and vader.
|
||
|
- Fix a name collision in the shared file pointer MPI IO file locking
|
||
|
scheme. Thanks to Nicolas Joly for reporting the issue.
|
||
|
- Fix datatype extent/offset errors in MPI_PUT and MPI_RACCUMULATE
|
||
|
when using the Portals 4 one-sided component.
|
||
|
- Add support for non-contiguous datatypes to the Portals 4 one-sided
|
||
|
component.
|
||
|
- Various updates for the UCX PML.
|
||
|
- Updates to the following man pages:
|
||
|
- mpirun(1)
|
||
|
- MPI_COMM_CONNECT(3)
|
||
|
- MPI_WIN_GET_NAME(3). Thanks to Nicolas Joly for reporting the
|
||
|
typo.
|
||
|
- MPI_INFO_GET_[NKEYS|NTHKEY](3). Thanks to Nicolas Joly for
|
||
|
reporting the typo.
|
||
|
- Fixed a problem in the TCP BTL when using MPI_THREAD_MULTIPLE.
|
||
|
Thanks to Evgueni Petrov for reporting.
|
||
|
- Fixed external32 representation in the romio314 module. Note that
|
||
|
for now, external32 representation is not correctly supported by the
|
||
|
ompio module. Thanks to Thomas Gastine for bringing this to our
|
||
|
attention.
|
||
|
- Add note how to disable a warning message about when a high-speed
|
||
|
MPI transport is not found. Thanks to Susan Schwarz for reporting
|
||
|
the issue.
|
||
|
- Ensure that sending SIGINT when using the rsh/ssh launcher does not
|
||
|
orphan children nodes in the launch tree.
|
||
|
- Fix the help message when showing deprecated MCA param names to show
|
||
|
the correct (i.e., deprecated) name.
|
||
|
- Enable support for the openib BTL to use multiple different
|
||
|
InfiniBand subnets.
|
||
|
- Fix a minor error in MPI_AINT_DIFF.
|
||
|
- Fix bugs with MPI_IN_PLACE handling in:
|
||
|
- MPI_ALLGATHER[V]
|
||
|
- MPI_[I][GATHER|SCATTER][V]
|
||
|
- MPI_IREDUCE[_SCATTER]
|
||
|
- Thanks to all the users who helped diagnose these issues.
|
||
|
- Allow qrsh to tree spawn (if the back-end system supports it).
|
||
|
- Fix MPI_T_PVAR_GET_INDEX to return the correct index.
|
||
|
- Correctly position the shared file pointer in append mode in the
|
||
|
OMPIO component.
|
||
|
- Add some deprecated names into shmem.h for backwards compatibility
|
||
|
with legacy codes.
|
||
|
- Fix MPI_MODE_NOCHECK support.
|
||
|
- Fix a regression in PowerPC atomics support. Thanks to Orion
|
||
|
Poplawski for reporting the issue.
|
||
|
- Fixes for assembly code with aggressively-optimized compilers on
|
||
|
x86_64/AMD64 platforms.
|
||
|
- Fix one more place where configure was mangling custom CFLAGS.
|
||
|
Thanks to Phil Tooley (@Telemin) for reporting the issue.
|
||
|
- Better handle builds with external installations of hwloc.
|
||
|
- Fixed a hang with MPI_PUT and MPI_WIN_LOCK_ALL.
|
||
|
- Fixed a bug when using MPI_GET on non-contiguous datatypes and
|
||
|
MPI_LOCK/MPI_UNLOCK.
|
||
|
- Fixed a bug when using POST/START/COMPLETE/WAIT after a fence.
|
||
|
- Fix configure portability by cleaning up a few uses of "==" with
|
||
|
"test". Thanks to Kevin Buckley for pointing out the issue.
|
||
|
- Fix bug when using darrays with lib and extent of darray datatypes.
|
||
|
- Updates to make Open MPI binary builds more bit-for-bit
|
||
|
reproducable. Thanks to Alastair McKinstry for the suggestion.
|
||
|
- Fix issues regarding persistent request handling.
|
||
|
- Ensure that shmemx.h is a standalone OpenSHMEM header file. Thanks
|
||
|
to Nick Park (@nspark) for the report.
|
||
|
- Ensure that we always send SIGTERM prior to SIGKILL. Thanks to Noel
|
||
|
Rycroft for the report.
|
||
|
- Added ConnectX-5 and Chelsio T6 device defaults for the openib BTL.
|
||
|
- OpenSHMEM no longer supports MXM less than v2.0.
|
||
|
- Plug a memory leak in ompi_osc_sm_free. Thanks to Joseph Schuchart
|
||
|
for the report.
|
||
|
- The "self" BTL now uses less memory.
|
||
|
- The vader BTL is now more efficient in terms of memory usage when
|
||
|
using XPMEM.
|
||
|
- Removed the --enable-openib-failover configure option. This is not
|
||
|
considered backwards-incompatible because this option was stale and
|
||
|
had long-since stopped working, anyway.
|
||
|
- Allow jobs launched under Cray aprun to use hyperthreads if
|
||
|
opal_hwloc_base_hwthreads_as_cpus MCA parameter is set.
|
||
|
- Add support for 32-bit and floating point Cray Aries atomic
|
||
|
operations.
|
||
|
- Add support for network AMOs for MPI_ACCUMULATE, MPI_FETCH_AND_OP,
|
||
|
and MPI_COMPARE_AND_SWAP if the "ompi_single_intrinsic" info key is
|
||
|
set on the window or the "acc_single_intrinsic" MCA param is set.
|
||
|
- Automatically disqualify RDMA CM support in the openib BTL if
|
||
|
MPI_THREAD_MULTIPLE is used.
|
||
|
- Make configure smarter/better about auto-detecting Linux CMA
|
||
|
support.
|
||
|
- Improve the scalability of MPI_COMM_SPLIT_TYPE.
|
||
|
- Fix the mixing of C99 and C++ header files with the MPI C++
|
||
|
bindings. Thanks to Alastair McKinstry for the bug report.
|
||
|
- Add support for ARM v8.
|
||
|
- Several MCA parameters now directly support MPI_T enumerator
|
||
|
semantics (i.e., they accept a limited set of values -- e.g., MCA
|
||
|
parameters that accept boolean values).
|
||
|
- Added --with-libmpi-name=STRING configure option for vendor releases
|
||
|
of Open MPI. See the README for more detail.
|
||
|
- Fix a problem with Open MPI's internal memory checker. Thanks to Yvan
|
||
|
Fournier for reporting.
|
||
|
- Fix a multi-threaded issue with MPI_WAIT. Thanks to Pascal Deveze for
|
||
|
reporting.
|
||
|
|
||
|
Known issues (to be addressed in v2.1.1):
|
||
|
|
||
|
- See the list of fixes slated for v2.1.1 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/26
|
||
|
|
||
|
2.0.4 -- November, 2017
|
||
|
-----------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
- Fix an issue with visibility of functions defined in the built-in PMIx.
|
||
|
Thanks to Siegmar Gross for reporting this issue.
|
||
|
- Add configure check to prevent trying to build this release of
|
||
|
Open MPI with an external hwloc 2.0 or newer release.
|
||
|
- Add ability to specify layered providers for OFI MTL.
|
||
|
- Fix a correctness issue with Open MPI's memory manager code
|
||
|
that could result in corrupted message data. Thanks to
|
||
|
Valentin Petrov for reporting.
|
||
|
- Fix issues encountered when using newer versions of PBS Pro.
|
||
|
Thanks to Petr Hanousek for reporting.
|
||
|
- Fix a problem with MPI_GET when using the vader BTL. Thanks
|
||
|
to Dahai Guo for reporting.
|
||
|
- Fix a problem when using MPI_ANY_SOURCE with MPI_SENDRECV_REPLACE.
|
||
|
Thanks to Dahai Guo for reporting.
|
||
|
- Fix a problem using MPI_FILE_OPEN with a communicator with an
|
||
|
attached cartesian topology. Thanks to Wei-keng Liao for reporting.
|
||
|
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
|
- Remove support for big endian PowerPC.
|
||
|
- Remove support for XL compilers older than v13.1
|
||
|
|
||
|
2.0.3 -- June 2017
|
||
|
------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
|
||
|
- Fix a problem with MPI_IALLTOALLW when zero size messages are present.
|
||
|
Thanks to @mathbird for reporting.
|
||
|
- Add missing MPI_USER_FUNCTION definition to the mpi_f08 module.
|
||
|
Thanks to Nathan Weeks for reporting this issue.
|
||
|
- Fix a problem with MPI_WIN_LOCK not returning an error code when
|
||
|
a negative rank is supplied. Thanks to Jeff Hammond for reporting and
|
||
|
providing a fix.
|
||
|
- Fix a problem with make check that could lead to hangs. Thanks to
|
||
|
Nicolas Morey-Chaisemartin for reporting.
|
||
|
- Resolve a symbol conflict problem with PMI-1 and PMI-2 PMIx components.
|
||
|
Thanks to Kilian Cavalotti for reporting this issue.
|
||
|
- Insure that memory allocations returned from MPI_WIN_ALLOCATE_SHARED are
|
||
|
64 byte aligned. Thanks to Joseph Schuchart for reporting this issue.
|
||
|
- Make use of DOUBLE_COMPLEX, if available, for Fortran bindings. Thanks
|
||
|
to Alexander Klein for reporting this issue.
|
||
|
- Add missing MPI_T_PVAR_SESSION_NULL definition to Open MPI mpi.h include
|
||
|
file. Thanks to Omri Mor for reporting and fixing.
|
||
|
- Fix a problem with use of MPI shared file pointers when accessing
|
||
|
a file from independent jobs. Thanks to Nicolas Joly for reporting
|
||
|
this issue.
|
||
|
- Optimize zero size MPI_IALLTOALL{V,W} with MPI_IN_PLACE. Thanks to
|
||
|
Lisandro Dalcín for the report.
|
||
|
- Fix a ROMIO buffer overflow problem for large transfers when using NFS
|
||
|
filesystems.
|
||
|
- Fix type of MPI_ARGV[S]_NULL which prevented it from being used
|
||
|
properly with MPI_COMM_SPAWN[_MULTIPLE] in the mpi_f08 module.
|
||
|
- Ensure to add proper linker flags to the wrapper compilers for
|
||
|
dynamic libraries on platforms that need it (e.g., RHEL 7.3 and
|
||
|
later).
|
||
|
- Get better performance on TCP-based networks 10Gbps and higher by
|
||
|
using OS defaults for buffer sizing.
|
||
|
- Fix a bug with MPI_[R][GET_]ACCUMULATE when using DARRAY datatypes.
|
||
|
- Fix handling of --with-lustre configure command line argument.
|
||
|
Thanks to Prentice Bisbal and Tim Mattox for reporting the issue.
|
||
|
- Added MPI_AINT_ADD and MPI_AINT_DIFF declarations to mpif.h. Thanks
|
||
|
to Aboorva Devarajan (@AboorvaDevarajan) for the bug report.
|
||
|
- Fix a problem in the TCP BTL when Open MPI is initialized with
|
||
|
MPI_THREAD_MULTIPLE support. Thanks to Evgueni Petro for analyzing and
|
||
|
reporting this issue.
|
||
|
- Fix yalla PML to properly handle underflow errors, and fixed a
|
||
|
memory leak with blocking non-contiguous sends.
|
||
|
- Restored ability to run autogen.pl on official distribution tarballs
|
||
|
(although this is still not recommended for most users!).
|
||
|
- Fix accuracy problems with MPI_WTIME on some systems by always using
|
||
|
either clock_gettime(3) or gettimeofday(3).
|
||
|
- Fix a problem where MPI_WTICK was not returning a higher time resolution
|
||
|
when available. Thanks to Mark Dixon for reporting this issue.
|
||
|
- Restore SGE functionality. Thanks to Kevin Buckley for the initial
|
||
|
report.
|
||
|
- Fix external hwloc compilation issues, and extend support to allow
|
||
|
using external hwloc installations as far back as v1.5.0. Thanks to
|
||
|
Orion Poplawski for raising the issue.
|
||
|
- Added latest Mellanox Connect-X and Chelsio T-6 adapter part IDs to
|
||
|
the openib list of default values.
|
||
|
- Do a better job of cleaning up session directories (e.g., in /tmp).
|
||
|
- Update a help message to indicate how to suppress a warning about
|
||
|
no high performance networks being detected by Open MPI. Thanks to
|
||
|
Susan Schwarz for reporting this issue.
|
||
|
- Fix a problem with mangling of custom CFLAGS when configuring Open MPI.
|
||
|
Thanks to Phil Tooley for reporting.
|
||
|
- Fix some minor memory leaks and remove some unused variables.
|
||
|
Thanks to Joshua Gerrard for reporting.
|
||
|
- Fix MPI_ALLGATHERV bug with MPI_IN_PLACE.
|
||
|
|
||
|
Known issues (to be addressed in v2.0.4):
|
||
|
|
||
|
- See the list of fixes slated for v2.0.4 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/29
|
||
|
|
||
|
2.0.2 -- 26 January 2017
|
||
|
-------------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
|
||
|
- Fix a problem with MPI_FILE_WRITE_SHARED when using MPI_MODE_APPEND and
|
||
|
Open MPI's native MPI-IO implementation. Thanks to Nicolas Joly for
|
||
|
reporting.
|
||
|
- Fix a typo in the MPI_WIN_GET_NAME man page. Thanks to Nicolas Joly
|
||
|
for reporting.
|
||
|
- Fix a race condition with ORTE's session directory setup. Thanks to
|
||
|
@tbj900 for reporting this issue.
|
||
|
- Fix a deadlock issue arising from Open MPI's approach to catching calls to
|
||
|
munmap. Thanks to Paul Hargrove for reporting and helping to analyze this
|
||
|
problem.
|
||
|
- Fix a problem with PPC atomics which caused make check to fail unless builtin
|
||
|
atomics configure option was enabled. Thanks to Orion Poplawski for reporting.
|
||
|
- Fix a problem with use of x86_64 cpuid instruction which led to segmentation
|
||
|
faults when Open MPI was configured with -O3 optimization. Thanks to Mark
|
||
|
Santcroos for reporting this problem.
|
||
|
- Fix a problem when using built in atomics configure options on PPC platforms
|
||
|
when building 32 bit applications. Thanks to Paul Hargrove for reporting.
|
||
|
- Fix a problem with building Open MPI against an external hwloc installation.
|
||
|
Thanks to Orion Poplawski for reporting this issue.
|
||
|
- Remove use of DATE in the message queue version string reported to debuggers to
|
||
|
insure bit-wise reproducibility of binaries. Thanks to Alastair McKinstry
|
||
|
for help in fixing this problem.
|
||
|
- Fix a problem with early exit of a MPI process without calling MPI_FINALIZE
|
||
|
or MPI_ABORT that could lead to job hangs. Thanks to Christof Koehler for
|
||
|
reporting.
|
||
|
- Fix a problem with forwarding of SIGTERM signal from mpirun to MPI processes
|
||
|
in a job. Thanks to Noel Rycroft for reporting this problem
|
||
|
- Plug some memory leaks in MPI_WIN_FREE discovered using Valgrind. Thanks
|
||
|
to Joseph Schuchart for reporting.
|
||
|
- Fix a problems MPI_NEIGHOR_ALLTOALL when using a communicator with an empty topology
|
||
|
graph. Thanks to Daniel Ibanez for reporting.
|
||
|
- Fix a typo in a PMIx component help file. Thanks to @njoly for reporting this.
|
||
|
- Fix a problem with Valgrind false positives when using Open MPI's internal memchecker.
|
||
|
Thanks to Yvan Fournier for reporting.
|
||
|
- Fix a problem with MPI_FILE_DELETE returning MPI_SUCCESS when
|
||
|
deleting a non-existent file. Thanks to Wei-keng Liao for reporting.
|
||
|
- Fix a problem with MPI_IMPROBE that could lead to hangs in subsequent MPI
|
||
|
point to point or collective calls. Thanks to Chris Pattison for reporting.
|
||
|
- Fix a problem when configure Open MPI for powerpc with --enable-mpi-cxx
|
||
|
enabled. Thanks to Alastair McKinstry for reporting.
|
||
|
- Fix a problem using MPI_IALLTOALL with MPI_IN_PLACE argument. Thanks to
|
||
|
Chris Ward for reporting.
|
||
|
- Fix a problem using MPI_RACCUMULATE with the Portals4 transport. Thanks to
|
||
|
@PDeveze for reporting.
|
||
|
- Fix an issue with static linking and duplicate symbols arising from PMIx
|
||
|
Slurm components. Thanks to Limin Gu for reporting.
|
||
|
- Fix a problem when using MPI dynamic memory windows. Thanks to
|
||
|
Christoph Niethammer for reporting.
|
||
|
- Fix a problem with Open MPI's pkgconfig files. Thanks to Alastair McKinstry
|
||
|
for reporting.
|
||
|
- Fix a problem with MPI_IREDUCE when the same buffer is supplied for the
|
||
|
send and recv buffer arguments. Thanks to Valentin Petrov for reporting.
|
||
|
- Fix a problem with atomic operations on PowerPC. Thanks to Paul
|
||
|
Hargrove for reporting.
|
||
|
|
||
|
Known issues (to be addressed in v2.0.3):
|
||
|
|
||
|
- See the list of fixes slated for v2.0.3 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/23
|
||
|
|
||
|
2.0.1 -- 2 September 2016
|
||
|
-----------------------
|
||
|
|
||
|
Bug fixes/minor improvements:
|
||
|
|
||
|
- Short message latency and message rate performance improvements for
|
||
|
all transports.
|
||
|
- Fix shared memory performance when using RDMA-capable networks.
|
||
|
Thanks to Tetsuya Mishima and Christoph Niethammer for reporting.
|
||
|
- Fix bandwith performance degredation in the yalla (MXM) PML. Thanks
|
||
|
to Andreas Kempf for reporting the issue.
|
||
|
- Fix OpenSHMEM crash when running on non-Mellanox MXM-based networks.
|
||
|
Thanks to Debendra Das for reporting the issue.
|
||
|
- Fix a crash occuring after repeated calls to MPI_FILE_SET_VIEW with
|
||
|
predefined datatypes. Thanks to Eric Chamberland and Matthew
|
||
|
Knepley for reporting and helping chase down this issue.
|
||
|
- Fix stdin propagation to MPI processes. Thanks to Jingchao Zhang
|
||
|
for reporting the issue.
|
||
|
- Fix various runtime and portability issues by updating the PMIx
|
||
|
internal component to v1.1.5.
|
||
|
- Fix process startup failures on Intel MIC platforms due to very
|
||
|
large entries in /proc/mounts.
|
||
|
- Fix a problem with use of relative path for specifing executables to
|
||
|
mpirun/oshrun. Thanks to David Schneider for reporting.
|
||
|
- Various improvements when running over portals-based networks.
|
||
|
- Fix thread-based race conditions with GNI-based networks.
|
||
|
- Fix a problem with MPI_FILE_CLOSE and MPI_FILE_SET_SIZE. Thanks
|
||
|
to Cihan Altinay for reporting.
|
||
|
- Remove all use of rand(3) from within Open MPI so as not to perturb
|
||
|
applications use of it. Thanks to Matias Cabral and Noel Rycroft
|
||
|
for reporting.
|
||
|
- Fix crash in MPI_COMM_SPAWN.
|
||
|
- Fix types for MPI_UNWEIGHTED and MPI_WEIGHTS_EMPTY. Thanks to
|
||
|
Lisandro Dalcín for reporting.
|
||
|
- Correctly report the name of MPI_INTEGER16.
|
||
|
- Add some missing MPI constants to the Fortran bindings.
|
||
|
- Fixed compile error when configuring Open MPI with --enable-timing.
|
||
|
- Correctly set the shared library version of libompitrace.so. Thanks
|
||
|
to Alastair McKinstry for reporting.
|
||
|
- Fix errors in the MPI_RPUT, MPI_RGET, MPI_RACCUMULATE, and
|
||
|
MPI_RGET_ACCUMULATE Fortran bindings. Thanks to Alfio Lazzaro and
|
||
|
Joost VandeVondele for tracking this down.
|
||
|
- Fix problems with use of derived datatypes in non-blocking
|
||
|
collectives. Thanks to Yuki Matsumoto for reporting.
|
||
|
- Fix problems with OpenSHMEM header files when using CMake. Thanks to
|
||
|
Paul Kapinos for reporting the issue.
|
||
|
- Fix problem with use use of non-zero lower bound datatypes in
|
||
|
collectives. Thanks to Hristo Iliev for reporting.
|
||
|
- Fix a problem with memory allocation within MPI_GROUP_INTERSECTION.
|
||
|
Thanks to Lisandro Dalcín for reporting.
|
||
|
- Fix an issue with MPI_ALLGATHER for communicators that don't consist
|
||
|
of two ranks. Thanks to David Love for reporting.
|
||
|
- Various fixes for collectives when used with esoteric MPI datatypes.
|
||
|
- Fixed corner cases of handling DARRAY and HINDEXED_BLOCK datatypes.
|
||
|
- Fix a problem with filesystem type check for OpenBSD.
|
||
|
Thanks to Paul Hargrove for reporting.
|
||
|
- Fix some debug input within Open MPI internal functions. Thanks to
|
||
|
Durga Choudhury for reporting.
|
||
|
- Fix a typo in a configury help message. Thanks to Paul Hargrove for
|
||
|
reporting.
|
||
|
- Correctly support MPI_IN_PLACE in MPI_[I]ALLTOALL[V|W] and
|
||
|
MPI_[I]EXSCAN.
|
||
|
- Fix alignment issues on SPARC platforms.
|
||
|
|
||
|
Known issues (to be addressed in v2.0.2):
|
||
|
|
||
|
- See the list of fixes slated for v2.0.2 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/20, and
|
||
|
https://github.com/open-mpi/ompi-release/milestone/19
|
||
|
(note that the "ompi-release" Github repo will be folded/absorbed
|
||
|
into the "ompi" Github repo at some point in the future)
|
||
|
|
||
|
|
||
|
2.0.0 -- 12 July 2016
|
||
|
---------------------
|
||
|
|
||
|
**********************************************************************
|
||
|
* Open MPI is now fully MPI-3.1 compliant
|
||
|
**********************************************************************
|
||
|
|
||
|
Major new features:
|
||
|
|
||
|
- Many enhancements to MPI RMA. Open MPI now maps MPI RMA operations
|
||
|
on to native RMA operations for those networks which support this
|
||
|
capability.
|
||
|
- Greatly improved support for MPI_THREAD_MULTIPLE (when configured
|
||
|
with --enable-mpi-thread-multiple).
|
||
|
- Enhancements to reduce the memory footprint for jobs at scale. A
|
||
|
new MCA parameter, "mpi_add_procs_cutoff", is available to set the
|
||
|
threshold for using this feature.
|
||
|
- Completely revamped support for memory registration hooks when using
|
||
|
OS-bypass network transports.
|
||
|
- Significant OMPIO performance improvements and many bug fixes.
|
||
|
- Add support for PMIx - Process Management Interface for Exascale.
|
||
|
Version 1.1.2 of PMIx is included internally in this release.
|
||
|
- Add support for PLFS file systems in Open MPI I/O.
|
||
|
- Add support for UCX transport.
|
||
|
- Simplify build process for Cray XC systems. Add support for
|
||
|
using native SLURM.
|
||
|
- Add a --tune mpirun command line option to simplify setting many
|
||
|
environment variables and MCA parameters.
|
||
|
- Add a new MCA parameter "orte_default_dash_host" to offer an analogue
|
||
|
to the existing "orte_default_hostfile" MCA parameter.
|
||
|
- Add the ability to specify the number of desired slots in the mpirun
|
||
|
--host option.
|
||
|
|
||
|
Changes in behavior compared to prior versions:
|
||
|
|
||
|
- In environments where mpirun cannot automatically determine the
|
||
|
number of slots available (e.g., when using a hostfile that does not
|
||
|
specify "slots", or when using --host without specifying a ":N"
|
||
|
suffix to hostnames), mpirun now requires the use of "-np N" to
|
||
|
specify how many MPI processes to launch.
|
||
|
- The MPI C++ bindings -- which were removed from the MPI standard in
|
||
|
v3.0 -- are no longer built by default and will be removed in some
|
||
|
future version of Open MPI. Use the --enable-mpi-cxx-bindings
|
||
|
configure option to build the deprecated/removed MPI C++ bindings.
|
||
|
- ompi_info now shows all components, even if they do not have MCA
|
||
|
parameters. The prettyprint output now separates groups with a
|
||
|
dashed line.
|
||
|
- OMPIO is now the default implementation of parallel I/O, with the
|
||
|
exception for Lustre parallel filesystems (where ROMIO is still the
|
||
|
default). The default selection of OMPI vs. ROMIO can be controlled
|
||
|
via the "--mca io ompi|romio" command line switch to mpirun.
|
||
|
- Per Open MPI's versioning scheme (see the README), increasing the
|
||
|
major version number to 2 indicates that this version is not
|
||
|
ABI-compatible with prior versions of Open MPI. You will need to
|
||
|
recompile MPI and OpenSHMEM applications to work with this version
|
||
|
of Open MPI.
|
||
|
- Removed checkpoint/restart code due to loss of maintainer. :-(
|
||
|
- Change the behavior for handling certain signals when using PSM and
|
||
|
PSM2 libraries. Previously, the PSM and PSM2 libraries would trap
|
||
|
certain signals in order to generate tracebacks. The mechanism was
|
||
|
found to cause issues with Open MPI's own error reporting mechanism.
|
||
|
If not already set, Open MPI now sets the IPATH_NO_BACKTRACE and
|
||
|
HFI_NO_BACKTRACE environment variables to disable PSM/PSM2's
|
||
|
handling these signals.
|
||
|
|
||
|
Removed legacy support:
|
||
|
|
||
|
- Removed support for OS X Leopard.
|
||
|
- Removed support for Cray XT systems.
|
||
|
- Removed VampirTrace.
|
||
|
- Removed support for Myrinet/MX.
|
||
|
- Removed legacy collective module:ML.
|
||
|
- Removed support for Alpha processors.
|
||
|
- Removed --enable-mpi-profiling configure option.
|
||
|
|
||
|
Known issues (to be addressed in v2.0.1):
|
||
|
|
||
|
- See the list of fixes slated for v2.0.1 here:
|
||
|
https://github.com/open-mpi/ompi/milestone/16, and
|
||
|
https://github.com/open-mpi/ompi-release/milestone/16
|
||
|
(note that the "ompi-release" Github repo will be folded/absorbed
|
||
|
into the "ompi" Github repo at some point in the future)
|
||
|
|
||
|
- ompi-release#986: Fix data size counter for large ops with fcoll/static
|
||
|
- ompi-release#987: Fix OMPIO performance on Lustre
|
||
|
- ompi-release#1013: Fix potential inconsistency in btl/openib default settings
|
||
|
- ompi-release#1014: Do not return MPI_ERR_PENDING from collectives
|
||
|
- ompi-release#1056: Remove dead profile code from oshmem
|
||
|
- ompi-release#1081: Fix MPI_IN_PLACE checking for IALLTOALL{V|W}
|
||
|
- ompi-release#1081: Fix memchecker in MPI_IALLTOALLW
|
||
|
- ompi-release#1081: Support MPI_IN_PLACE in MPI_(I)ALLTOALLW and MPI_(I)EXSCAN
|
||
|
- ompi-release#1107: Allow future PMIx support for RM spawn limits
|
||
|
- ompi-release#1108: Fix sparse group process reference counting
|
||
|
- ompi-release#1109: If specified to be oversubcribed, disable binding
|
||
|
- ompi-release#1122: Allow NULL arrays for empty datatypes
|
||
|
- ompi-release#1123: Fix signed vs. unsigned compiler warnings
|
||
|
- ompi-release#1123: Make max hostname length uniform across code base
|
||
|
- ompi-release#1127: Fix MPI_Compare_and_swap
|
||
|
- ompi-release#1127: Fix MPI_Win_lock when used with MPI_Win_fence
|
||
|
- ompi-release#1132: Fix typo in help message for --enable-mca-no-build
|
||
|
- ompi-release#1154: Ensure pairwise coll algorithms disqualify themselves properly
|
||
|
- ompi-release#1165: Fix typos in debugging/verbose message output
|
||
|
- ompi-release#1178: Fix ROMIO filesystem check on OpenBSD 5.7
|
||
|
- ompi-release#1197: Fix Fortran pthread configure check
|
||
|
- ompi-release#1205: Allow using external PMIx 1.1.4 and 2.0
|
||
|
- ompi-release#1215: Fix configure to support the NAG Fortran compiler
|
||
|
- ompi-release#1220: Fix combiner args for MPI_HINDEXED_BLOCK
|
||
|
- ompi-release#1225: Fix combiner args for MPI_DARRAY
|
||
|
- ompi-release#1226: Disable old memory hooks with recent gcc versions
|
||
|
- ompi-release#1231: Fix new "patcher" support for some XLC platforms
|
||
|
- ompi-release#1244: Fix Java error handling
|
||
|
- ompi-release#1250: Ensure TCP is not selected for RDMA operations
|
||
|
- ompi-release#1252: Fix verbose output in coll selection
|
||
|
- ompi-release#1253: Set a default name for user-defined MPI_Op
|
||
|
- ompi-release#1254: Add count==0 checks in some non-blocking colls
|
||
|
- ompi-release#1258: Fix "make distclean" when using external pmix/hwloc/libevent
|
||
|
- ompi-release#1260: Clean up/uniform mca/coll/base memory management
|
||
|
- ompi-release#1261: Remove "patcher" warning message for static builds
|
||
|
- ompi-release#1263: Fix IO MPI_Request for 0-size read/write
|
||
|
- ompi-release#1264: Add blocking fence for SLURM operations
|
||
|
|
||
|
Bug fixes / minor enhancements:
|
||
|
|
||
|
- Updated internal/embedded copies of third-party software:
|
||
|
- Update the internal copy of ROMIO to that which shipped in MPICH
|
||
|
3.1.4.
|
||
|
- Update internal copy of libevent to v2.0.22.
|
||
|
- Update internal copy of hwloc to v1.11.2.
|
||
|
- Notable new MCA parameters:
|
||
|
- opal_progress_lp_call_ration: Control how often low-priority
|
||
|
callbacks are made during Open MPI's main progress loop.
|
||
|
- opal_common_verbs_want_fork_support: This replaces the
|
||
|
btl_openib_want_fork_support parameter.
|
||
|
- Add --with-platform-patches-dir configure option.
|
||
|
- Add --with-pmi-libdir configure option for environments that install
|
||
|
PMI libs in a non-default location.
|
||
|
- Various configure-related compatibility updates for newer versions
|
||
|
of libibverbs and OFED.
|
||
|
- Numerous fixes/improvements to orte-dvm. Special thanks to Mark
|
||
|
Santcroos for his help.
|
||
|
- Fix a problem with timer code on ia32 platforms. Thanks to
|
||
|
Paul Hargrove for reporting this and providing a patch.
|
||
|
- Fix a problem with use of a 64 bit atomic counter. Thanks to
|
||
|
Paul Hargrove for reporting.
|
||
|
- Fix a problem with singleton job launching. Thanks to Lisandro
|
||
|
Dalcín for reporting.
|
||
|
- Fix a problem with use of MPI_UNDEFINED with MPI_COMM_SPLIT_TYPE.
|
||
|
Thanks to Lisandro Dalcín for reporting.
|
||
|
- Silence a compiler warning in PSM MTL. Thanks to Adrian Reber for
|
||
|
reporting this.
|
||
|
- Properly detect Intel TrueScale and OmniPath devices in the ACTIVE
|
||
|
state. Thanks to Durga Choudhury for reporting the issue.
|
||
|
- Fix detection and use of Solaris Studio 12.5 (beta) compilers.
|
||
|
Thanks to Paul Hargrove for reporting and debugging.
|
||
|
- Fix various small memory leaks.
|
||
|
- Allow NULL arrays when creating empty MPI datatypes.
|
||
|
- Replace use of alloca with malloc for certain datatype creation
|
||
|
functions. Thanks to Bogdan Sataric for reporting this.
|
||
|
- Fix use of MPI_LB and MPI_UB in creation of of certain MPI datatypes.
|
||
|
Thanks to Gus Correa for helping to fix this.
|
||
|
- Implement a workaround for a GNU Libtool problem. Thanks to Eric
|
||
|
Schnetter for reporting and fixing.
|
||
|
- Improve hcoll library detection in configure. Thanks to David
|
||
|
Shrader and Åke Sandgren for reporting this.
|
||
|
- Miscellaneous minor bug fixes in the hcoll component.
|
||
|
- Miscellaneous minor bug fixes in the ugni component.
|
||
|
- Fix problems with XRC detection in OFED 3.12 and older releases.
|
||
|
Thanks to Paul Hargrove for his analysis of this problem.
|
||
|
- Update (non-standard/experimental) Java MPI interfaces to support
|
||
|
MPI-3.1 functionality.
|
||
|
- Fix an issue with MCA parameters for Java bindings. Thanks to
|
||
|
Takahiro Kawashima and Siegmar Gross for reporting this issue.
|
||
|
- Fix a problem when using persistent requests in the Java bindings.
|
||
|
Thanks to Nate Chambers for reporting.
|
||
|
- Fix problem with Java bindings on OX X 10.11. Thanks to Alexander
|
||
|
Daryin for reporting this issue.
|
||
|
- Fix a performance problem for large messages for Cray XC systems.
|
||
|
Thanks to Jerome Vienne for reporting this.
|
||
|
- Fix an issue with MPI_WIN_LOCK_ALL. Thanks to Thomas Jahns for
|
||
|
reporting.
|
||
|
- Fix an issue with passing a parameter to configure multiple times.
|
||
|
Thanks to QuesarVII for reporting and supplying a fix.
|
||
|
- Add support for ALPS resource allocation system on Cray CLE 5.2 and
|
||
|
later. Thanks to Mark Santcroos.
|
||
|
- Corrections to the HACKING file. Thanks to Maximilien Levesque.
|
||
|
- Fix an issue with user supplied reduction operator functions.
|
||
|
Thanks to Rupert Nash for reporting this.
|
||
|
- Fix an issue with an internal list management function. Thanks to
|
||
|
Adrian Reber for reporting this.
|
||
|
- Fix a problem with MPI-RMA PSCW epochs. Thanks to Berk Hess for
|
||
|
reporting this.
|
||
|
- Fix a problem in neighborhood collectives. Thanks to Lisandro
|
||
|
Dalcín for reporting.
|
||
|
- Fix MPI_IREDUCE_SCATTER_BLOCK for a one-process communicator. Thanks
|
||
|
to Lisandro Dalcín for reporting.
|
||
|
- Add (Open MPI-specific) additional flavors to MPI_COMM_SPLIT_TYPE.
|
||
|
See MPI_Comm_split_type(3) for details. Thanks to Nick Andersen for
|
||
|
supplying this enhancement.
|
||
|
- Improve closing of file descriptors during the job launch phase.
|
||
|
Thanks to Piotr Lesnicki for reporting and providing this
|
||
|
enhancement.
|
||
|
- Fix a problem in MPI_GET_ACCUMULATE and MPI_RGET_ACCUMULATE when
|
||
|
using Portals4. Thanks to Nicolas Chevalier for reporting.
|
||
|
- Use correct include file for lstat prototype in ROMIO. Thanks to
|
||
|
William Throwe for finding and providing a fix.
|
||
|
- Add missing Fortran bindings for MPI_WIN_ALLOCATE. Thanks to Christoph
|
||
|
Niethammer for reporting and fixing.
|
||
|
- Fortran related fixes to handle Intel 2016 compiler. Thanks to
|
||
|
Fabrice Roy for reporting this.
|
||
|
- Fix a Fortran linkage issue. Thanks to Macro Atzeri for finding and
|
||
|
suggesting a fix.
|
||
|
- Fix problem with using BIND(C) for Fortran bindings with logical
|
||
|
parameters. Thanks to Paul Romano for reporting.
|
||
|
- Fix an issue with use of DL-related macros in opal library. Thanks to
|
||
|
Scott Atchley for finding this.
|
||
|
- Fix an issue with parsing mpirun command line options which contain
|
||
|
colons. Thanks to Lev Given for reporting.
|
||
|
- Fix a problem with Open MPI's package configury files. Thanks to
|
||
|
Christoph Junghans for reporting.
|
||
|
- Fix a typo in the MPI_INTERCOMM_MERGE man page. Thanks To Harald
|
||
|
Servat for reporting and correcting.
|
||
|
- Update man pages for non-blocking sends per MPI 3.1 standard.
|
||
|
Thanks to Alexander Pozdneev for reporting.
|
||
|
- Fix problem when compiling against PVFS2. Thanks to Dave Love for
|
||
|
reporting.
|
||
|
- Fix problems with MPI_NEIGHBOR_ALLTOALL{V,W}. Thanks to Willem
|
||
|
Vermin for reporting this issue.
|
||
|
- Fix various compilation problems on Cygwin. Thanks to Marco Atzeri
|
||
|
for supplying these fixes.
|
||
|
- Fix problem with resizing of subarray and darray data types. Thanks
|
||
|
to Keith Bennett and Dan Garmann for reporting.
|
||
|
- Fix a problem with MPI_COMBINER_RESIZED. Thanks to James Ramsey for
|
||
|
the report.
|
||
|
- Fix an hwloc binding issue. Thanks to Ben Menadue for reporting.
|
||
|
- Fix a problem with the shared memory (sm) BTL. Thanks to Peter Wind
|
||
|
for the report.
|
||
|
- Fixes for heterogeneous support. Thanks to Siegmar Gross for reporting.
|
||
|
- Fix a problem with memchecker. Thanks to Clinton Simpson for reporting.
|
||
|
- Fix a problem with MPI_UNWEIGHTED in topology functions. Thanks to
|
||
|
Jun Kudo for reporting.
|
||
|
- Fix problem with a MCA parameter base filesystem types. Thanks to
|
||
|
Siegmar Gross for reporting.
|
||
|
- Fix a problem with some windows info argument types. Thanks to
|
||
|
Alastair McKinstry for reporting.
|
||
|
|
||
|
|
||
|
1.10.7 - 16 May 2017
|
||
|
------
|
||
|
- Fix bug in TCP BTL that impacted performance on 10GbE (and faster)
|
||
|
networks by not adjusting the TCP send/recv buffer sizes and using
|
||
|
system default values
|
||
|
- Add missing MPI_AINT_ADD and MPI_AINT_DIFF function delcarations in
|
||
|
mpif.h
|
||
|
- Fixed time reported by MPI_WTIME; it was previously reported as
|
||
|
dependent upon the CPU frequency.
|
||
|
- Fix platform detection on FreeBSD
|
||
|
- Fix a bug in the handling of MPI_TYPE_CREATE_DARRAY in
|
||
|
MPI_(R)(GET_)ACCUMULATE
|
||
|
- Fix openib memory registration limit calculation
|
||
|
- Add missing MPI_T_PVAR_SESSION_NULL in mpi.h
|
||
|
- Fix "make distcheck" when using external hwloc and/or libevent packages
|
||
|
- Add latest ConnectX-5 vendor part id to OpenIB device params
|
||
|
- Fix race condition in the UCX PML
|
||
|
- Fix signal handling for rsh launcher
|
||
|
- Fix Fortran compilation errors by removing MPI_SIZEOF in the Fortran
|
||
|
interfaces when the compiler does not support it
|
||
|
- Fixes for the pre-ignore-TKR "mpi" Fortran module implementation
|
||
|
(i.e., for older Fortran compilers -- these problems did not exist
|
||
|
in the "mpi" module implementation for modern Fortran compilers):
|
||
|
- Add PMPI_* interfaces
|
||
|
- Fix typo in MPI_FILE_WRITE_AT_ALL_BEGIN interface name
|
||
|
- Fix typo in MPI_FILE_READ_ORDERED_BEGIN interface name
|
||
|
- Fixed the type of MPI_DISPLACEMENT_CURRENT in all Fortran interfaces
|
||
|
to be an INTEGER(KIND=MPI_OFFSET_KIND).
|
||
|
- Fixed typos in MPI_INFO_GET_* man pages. Thanks to Nicolas Joly for
|
||
|
the patch
|
||
|
- Fix typo bugs in wrapper compiler script
|
||
|
|
||
|
|
||
|
1.10.6 - 17 Feb 2017
|
||
|
------
|
||
|
- Fix bug in timer code that caused problems at optimization settings
|
||
|
greater than 2
|
||
|
- OSHMEM: make mmap allocator the default instead of sysv or verbs
|
||
|
- Support MPI_Dims_create with dimension zero
|
||
|
- Update USNIC support
|
||
|
- Prevent 64-bit overflow on timer counter
|
||
|
- Add support for forwarding signals
|
||
|
- Fix bug that caused truncated messages on large sends over TCP BTL
|
||
|
- Fix potential infinite loop when printing a stacktrace
|
||
|
|
||
|
|
||
|
1.10.5 - 19 Dec 2016
|
||
|
------
|
||
|
- Update UCX APIs
|
||
|
- Fix bug in darray that caused MPI/IO failures
|
||
|
- Use a MPI_Get_library_version() like string to tag the debugger DLL.
|
||
|
Thanks to Alastair McKinstry for the report
|
||
|
- Fix multi-threaded race condition in coll/libnbc
|
||
|
- Several fixes to OSHMEM
|
||
|
- Fix bug in UCX support due to uninitialized field
|
||
|
- Fix MPI_Ialltoallv with MPI_IN_PLACE and without MPI param check
|
||
|
- Correctly reset receive request type before init. Thanks Chris Pattison
|
||
|
for the report and test case.
|
||
|
- Fix bug in iallgather[v]
|
||
|
- Fix concurrency issue with MPI_Comm_accept. Thanks to Pieter Noordhuis
|
||
|
for the patch
|
||
|
- Fix ompi_coll_base_{gather,scatter}_intra_binomial
|
||
|
- Fixed an issue with MPI_Type_get_extent returning the wrong extent
|
||
|
for distributed array datatypes.
|
||
|
- Re-enable use of rtdtsc instruction as a monotonic clock source if
|
||
|
the processor has a core-invariant tsc. This is a partial fix for a
|
||
|
performance regression introduced in Open MPI v1.10.3.
|
||
|
|
||
|
|
||
|
1.10.4 - 01 Sept 2016
|
||
|
------
|
||
|
|
||
|
- Fix assembler support for MIPS
|
||
|
- Improve memory handling for temp buffers in collectives
|
||
|
- Fix [all]reduce with non-zero lower bound datatypes
|
||
|
Thanks Hristo Iliev for the report
|
||
|
- Fix non-standard ddt handling. Thanks Yuki Matsumoto for the report
|
||
|
- Various libnbc fixes. Thanks Yuki Matsumoto for the report
|
||
|
- Fix typos in request RMA bindings for Fortran. Thanks to @alazzaro
|
||
|
and @vondele for the assist
|
||
|
- Various bug fixes and enhancements to collective support
|
||
|
- Fix predefined types mapping in hcoll
|
||
|
- Revive the coll/sync component to resolve unexpected message issues
|
||
|
during tight loops across collectives
|
||
|
- Fix typo in wrapper compiler for Fortran static builds
|
||
|
|
||
|
|
||
|
1.10.3 - 15 June 2016
|
||
|
------
|
||
|
|
||
|
- Fix zero-length datatypes. Thanks to Wei-keng Liao for reporting
|
||
|
the issue.
|
||
|
- Minor manpage cleanups
|
||
|
- Implement atomic support in OSHMEM/UCX
|
||
|
- Fix support of MPI_COMBINER_RESIZED. Thanks to James Ramsey
|
||
|
for the report
|
||
|
- Fix computation of #cpus when --use-hwthread-cpus is used
|
||
|
- Add entry points for Allgatherv, iAllgatherv, Reduce, and iReduce
|
||
|
for the HCOLL library
|
||
|
- Fix an HCOLL integration bug that could signal completion of request
|
||
|
while still being worked
|
||
|
- Fix computation of cores when SMT is enabled. Thanks to Ben Menadue
|
||
|
for the report
|
||
|
- Various USNIC fixes
|
||
|
- Create a datafile in the per-proc directory in order to make it
|
||
|
unique per communicator. Thanks to Peter Wind for the report
|
||
|
- Fix zero-size malloc in one-sided pt-to-pt code. Thanks to Lisandro
|
||
|
Dalcín for the report
|
||
|
- Fix MPI_Get_address when passed MPI_BOTTOM to not return an error.
|
||
|
Thanks to Lisandro Dalcín for the report
|
||
|
- Fix MPI_TYPE_SET_ATTR with NULL value. Thanks to Lisandro Dalcín for
|
||
|
the report
|
||
|
- Fix various Fortran08 binding issues
|
||
|
- Fix memchecker no-data case. Thanks to Clinton Stimpson for the report
|
||
|
- Fix CUDA support under OS-X
|
||
|
- Fix various OFI/MTL integration issues
|
||
|
- Add MPI_T man pages
|
||
|
- Fix one-sided pt-to-pt issue by preventing communication from happening
|
||
|
before a target enters a fence, even in the no-precede case
|
||
|
- Fix a bug that disabled Totalview for MPMD use-case
|
||
|
- Correctly support MPI_UNWEIGHTED in topo-graph-neighbors. Thanks to
|
||
|
Jun Kudo for the report
|
||
|
- Fix singleton operations under SLURM when PMI2 is enabled
|
||
|
- Do not use MPI_IN_PLACE in neighborhood collectives for non-blocking
|
||
|
collectives (libnbc). Thanks to Jun Kudo for the report
|
||
|
- Silence autogen deprecation warnings for newer versions of Perl
|
||
|
- Do not return MPI_ERR_PENDING from collectives
|
||
|
- Use type int* for MPI_WIN_DISP_UNIT, MPI_WIN_CREATE_FLAVOR, and MPI_WIN_MODEL.
|
||
|
Thanks to Alastair McKinstry for the report
|
||
|
- Fix register_datarep stub function in IO/OMPIO. Thanks to Eric
|
||
|
Chamberland for the report
|
||
|
- Fix a bus error on MPI_WIN_[POST,START] in the shared memory one-sided component
|
||
|
- Add several missing MPI_WIN_FLAVOR constants to the Fortran support
|
||
|
- Enable connecting processes from different subnets using the openib BTL
|
||
|
- Fix bug in basic/barrier algorithm in OSHMEM
|
||
|
- Correct process binding for the --map-by node case
|
||
|
- Include support for subnet-to-subnet routing over InfiniBand networks
|
||
|
- Fix usnic resource check
|
||
|
- AUTHORS: Fix an errant reference to Subversion IDs
|
||
|
- Fix affinity for MPMD jobs running under LSF
|
||
|
- Fix many Fortran binding bugs
|
||
|
- Fix `MPI_IN_PLACE`-related bugs
|
||
|
- Fix PSM/PSM2 support for singleton operations
|
||
|
- Ensure MPI transports continue to progress during RTE barriers
|
||
|
- Update HWLOC to 1.9.1 end-of-series
|
||
|
- Fix a bug in the Java command line parser when the
|
||
|
-Djava.library.path options was given by the user
|
||
|
- Update the MTL/OFI provider selection behavior
|
||
|
- Add support for clock_gettime on Linux.
|
||
|
- Correctly detect and configure for Solaris Studio 12.5
|
||
|
beta compilers
|
||
|
- Correctly compute #slots when -host is used for MPMD case
|
||
|
- Fix a bug in the hcoll collectives due to an uninitialized field
|
||
|
- Do not set a binding policy when oversubscribing a node
|
||
|
- Fix hang in intercommunicator operations when oversubscribed
|
||
|
- Speed up process termination during MPI_Abort
|
||
|
- Disable backtrace support by default in the PSM/PSM2 libraries to
|
||
|
prevent unintentional conflicting behavior.
|
||
|
|
||
|
|
||
|
|
||
|
1.10.2: 26 Jan 2016
|
||
|
-------------------
|
||
|
|
||
|
**********************************************************************
|
||
|
* OSHMEM is now 1.2 compliant
|
||
|
**********************************************************************
|
||
|
|
||
|
- Fix NBC_Copy for legitimate zero-size messages
|
||
|
- Fix multiple bugs in OSHMEM
|
||
|
- Correctly handle mpirun --host <user>@<ip-address>
|
||
|
- Centralize two MCA params to avoid duplication between OMPI and
|
||
|
OSHMEM layers: opal_abort_delay and opal_abort_print_stack
|
||
|
- Add support for Fujitsu compilers
|
||
|
- Add UCX support for OMPI and OSHMEM
|
||
|
- Correctly handle oversubscription when not given directives
|
||
|
to permit it. Thanks to @ammore1 for reporting it
|
||
|
- Fix rpm spec file to not include the /usr directory
|
||
|
- Add Intel HFI1 default parameters for the openib BTL
|
||
|
- Resolve symbol conflicts in the PSM2 library
|
||
|
- Add ability to empty the rgpusm cache when full if requested
|
||
|
- Fix another libtool bug when -L requires a space between it
|
||
|
and the path. Thanks to Eric Schnetter for the patch.
|
||
|
- Add support for OSHMEM v1.2 APIs
|
||
|
- Improve efficiency of oshmem_preconnect_all algorithm
|
||
|
- Fix bug in buffered sends support
|
||
|
- Fix double free in edge case of mpirun. Thanks to @jsharpe for
|
||
|
the patch
|
||
|
- Multiple one-sided support fixes
|
||
|
- Fix integer overflow in the tuned "reduce" collective when
|
||
|
using buffers larger than INT_MAX in size
|
||
|
- Fix parse of user environment variables in mpirun. Thanks to
|
||
|
Stefano Garzarella for the patch
|
||
|
- Performance improvements in PSM2 support
|
||
|
- Fix NBS iBarrier for inter-communicators
|
||
|
- Fix bug in vader BTL during finalize
|
||
|
- Improved configure support for Fortran compilers
|
||
|
- Fix rank_file mapper to support default --slot-set. Thanks
|
||
|
to Matt Thompson for reporting it
|
||
|
- Update MPI_Testsome man page. Thanks to Eric Schnetter for
|
||
|
the suggestion
|
||
|
- Fix missing resize of the returned type for subarray and
|
||
|
darray types. Thanks to Keith Bennett and Dan Garmann for
|
||
|
reporting it
|
||
|
- Fix Java support on OSX 10.11. Thanks to Alexander Daryin
|
||
|
for reporting the problem
|
||
|
- Fix some compilation issues on Solaris 11.2. Thanks to
|
||
|
Paul Hargrove for his continued help in such areas
|
||
|
|
||
|
|
||
|
1.10.1: 4 Nov 2015
|
||
|
------------------
|
||
|
|
||
|
- Workaround an optimization problem with gcc compilers >= 4.9.2 that
|
||
|
causes problems with memory registration, and forced
|
||
|
mpi_leave_pinned to default to 0 (i.e., off). Thanks to @oere for
|
||
|
the fix.
|
||
|
- Fix use of MPI_LB and MPI_UB in subarray and darray datatypes.
|
||
|
Thanks to Gus Correa and Dimitar Pashov for pointing out the issue.
|
||
|
- Minor updates to mpi_show_mpi_alloc_mem_leaks and
|
||
|
ompi_debug_show_handle_leaks functionality.
|
||
|
- Fix segv when invoking non-blocking reductions with a user-defined
|
||
|
operation. Thanks to Rupert Nash and Georg Geiser for identifying
|
||
|
the issue.
|
||
|
- No longer probe for PCI topology on Solaris (unless running as root).
|
||
|
- Fix for Intel Parallel Studio 2016 ifort partial support of the
|
||
|
!GCC$ pragma. Thanks to Fabrice Roy for reporting the problem.
|
||
|
- Bunches of Coverity / static analysis fixes.
|
||
|
- Fixed ROMIO to look for lstat in <sys/stat.h>. Thanks to William
|
||
|
Throwe for submitting the patch both upstream and to Open MPI.
|
||
|
- Fixed minor memory leak when attempting to open plugins.
|
||
|
- Fixed type in MPI_IBARRIER C prototype. Thanks to Harald Servat for
|
||
|
reporting the issue.
|
||
|
- Add missing man pages for MPI_WIN_CREATE_DYNAMIC, MPI_WIN_ATTACH,
|
||
|
MPI_WIN_DETACH, MPI_WIN_ALLOCATE, MPI_WIN_ALLOCATE_SHARED.
|
||
|
- When mpirun-launching new applications, only close file descriptors
|
||
|
that are actually open (resulting in a faster launch in some
|
||
|
environments).
|
||
|
- Fix "test ==" issues in Open MPI's configure script. Thank to Kevin
|
||
|
Buckley for pointing out the issue.
|
||
|
- Fix performance issue in usnic BTL: ensure progress thread is
|
||
|
throttled back to not aggressively steal CPU cycles.
|
||
|
- Fix cache line size detection on POWER architectures.
|
||
|
- Add missing #include in a few places. Thanks to Orion Poplawski for
|
||
|
supplying the patch.
|
||
|
- When OpenSHMEM building is disabled, no longer install its header
|
||
|
files, help files, or man pages. Add man pages for oshrun, oshcc,
|
||
|
and oshfort.
|
||
|
- Fix mpi_f08 implementations of MPI_COMM_SET_INFO, and profiling
|
||
|
versions of MPI_BUFFER_DETACH, MPI_WIN_ALLOCATE,
|
||
|
MPI_WIN_ALLOCATE_SHARED, MPI_WTICK, and MPI_WTIME.
|
||
|
- Add orte_rmaps_dist_device MCA param, allowing users to map near a
|
||
|
specific device.
|
||
|
- Various updates/fixes to the openib BTL.
|
||
|
- Add missing defaults for the Mellanox ConnectX 3 card to the openib BTL.
|
||
|
- Minor bug fixes in the OFI MTL.
|
||
|
- Various updates to Mellanox's MXM, hcoll, and FCA components.
|
||
|
- Add OpenSHMEM man pages. Thanks to Tony Curtis for sharing the man
|
||
|
pages files from openshmem.org.
|
||
|
- Add missing "const" attributes to MPI_COMPARE_AND_SWAP,
|
||
|
MPI_FETCH_AND_OP, MPI_RACCUMULATE, and MPI_WIN_DETACH prototypes.
|
||
|
Thanks to Michael Knobloch and Takahiro Kawashima for bringing this
|
||
|
to our attention.
|
||
|
- Fix linking issues on some platforms (e.g., SLES 12).
|
||
|
- Fix hang on some corner cases when MPI applications abort.
|
||
|
- Add missing options to mpirun man page. Thanks to Daniel Letai
|
||
|
for bringing this to our attention.
|
||
|
- Add new --with-platform-patches-dir configure option
|
||
|
- Adjust relative selection priorities to ensure that MTL
|
||
|
support is favored over BTL support when both are available
|
||
|
- Use CUDA IPC for all sized messages for performance
|
||
|
|
||
|
|
||
|
1.10.0: 25 Aug 2015
|
||
|
-------------------
|
||
|
|
||
|
** NOTE: The v1.10.0 release marks the transition to Open MPI's new
|
||
|
** version numbering scheme. The v1.10.x release series is based on
|
||
|
** the v1.8.x series, but with a few new features. v2.x will be the
|
||
|
** next series after the v1.10.x series, and complete the transition
|
||
|
** to the new version numbering scheme. See README for more details
|
||
|
** on the new versioning scheme.
|
||
|
**
|
||
|
** NOTE: In accordance with OMPI version numbering, the v1.10 is *not*
|
||
|
** API compatible with the v1.8 release series.
|
||
|
|
||
|
- Added libfabric support (see README for more details):
|
||
|
- usNIC BTL updated to use libfabric.
|
||
|
- Added OFI MTL (usable with PSM in libfabric v1.1.0).
|
||
|
- Added Intel Omni-Path support via new PSM2 MTL.
|
||
|
- Added "yalla" PML for faster MXM support.
|
||
|
- Removed support for MX
|
||
|
- Added persistent distributed virtual machine (pDVM) support for fast
|
||
|
workflow executions.
|
||
|
- Fixed typo in GCC inline assembly introduced in Open MPI v1.8.8.
|
||
|
Thanks to Paul Hargrove for pointing out the issue.
|
||
|
- Add missing man pages for MPI_Win_get|set_info(3).
|
||
|
- Ensure that session directories are cleaned up at the end of a run.
|
||
|
- Fixed linking issues on some OSs where symbols of dependent
|
||
|
libraries are not automatically publicly available.
|
||
|
- Improve hcoll and fca configury library detection. Thanks to David
|
||
|
Shrader for helping track down the issue.
|
||
|
- Removed the LAMA mapper (for use in setting affinity). Its
|
||
|
functionality has been largely superseded by other mpirun CLI
|
||
|
options.
|
||
|
- CUDA: Made the asynchronous copy mode be the default.
|
||
|
- Fix a malloc(0) warning in MPI_IREDUCE_SCATTER_BLOCK. Thanks to
|
||
|
Lisandro Dalcín for reporting the issue.
|
||
|
- Fix typo in MPI_Scatter(3) man page. Thanks to Akshay Venkatesh for
|
||
|
noticing the mistake.
|
||
|
- Add rudimentary protection from TCP port scanners.
|
||
|
- Fix typo in Open MPI error handling. Thanks to Åke Sandgren for
|
||
|
pointing out the error.
|
||
|
- Increased the performance of the CM PML (i.e., the Portals, PSM,
|
||
|
PSM2, MXM, and OFI transports).
|
||
|
- Restored visibility of blocking send requests in message queue
|
||
|
debuggers (e.g., TotalView, DDT).
|
||
|
- Fixed obscure IPv6-related bug in the TCP BTL.
|
||
|
- Add support for the "no_locks" MPI_Info key for one-sided
|
||
|
functionality.
|
||
|
- Fixed ibv_fork support for verbs-based networks.
|
||
|
- Fixed a variety of small bugs in OpenSHMEM.
|
||
|
- Fixed MXM configure with additional CPPFLAGS and LDFLAGS. Thanks to
|
||
|
David Shrader for the patch.
|
||
|
- Fixed incorrect memalign threshhold in the openib BTL. Thanks to
|
||
|
Xavier Besseron for pointing out the issue.
|
||
|
|
||
|
|
||
|
1.8.8: 5 Aug 2015
|
||
|
-----------------
|
||
|
|
||
|
- Fix a segfault in MPI_FINALIZE with the PSM MTL.
|
||
|
- Fix mpi_f08 sentinels (e.g., MPI_STATUS_IGNORE) handling.
|
||
|
- Set some additional MXM default values for OSHMEM.
|
||
|
- Fix an invalid memory access in MPI_MRECV and MPI_IMRECV.
|
||
|
- Include two fixes that were mistakenly left out of the official
|
||
|
v1.8.7 tarball:
|
||
|
- Fixed MPI_WIN_POST and MPI_WIN_START for zero-size messages
|
||
|
- Protect the OOB TCP ports from segfaulting when accessed by port
|
||
|
scanners
|
||
|
|
||
|
|
||
|
1.8.7: 15 Jul 2015
|
||
|
------------------
|
||
|
|
||
|
** NOTE: v1.8.7 technically breaks ABI with prior versions
|
||
|
** in the 1.8 series because it repairs two incorrect API
|
||
|
** signatures. However, users will only need to recompile
|
||
|
** if they were using those functions - which they couldn't
|
||
|
** have been, because the signatures were wrong :-)
|
||
|
|
||
|
- Plugged a memory leak that impacted blocking sends
|
||
|
- Fixed incorrect declaration for MPI_T_pvar_get_index and added
|
||
|
missing return code MPI_T_INVALID_NAME.
|
||
|
- Fixed an uninitialized variable in PMI2 support
|
||
|
- Added new vendor part id for Mellanox ConnectX4-LX
|
||
|
- Fixed NBC_Copy for legitimate zero-size messages
|
||
|
- Fixed MPI_Win_post and MPI_Win_start for zero-size messages
|
||
|
- Protect the OOB ports from segfaulting when accessed by port scanners
|
||
|
- Fixed several Fortran typos
|
||
|
- Fixed configure detection of XRC support
|
||
|
- Fixed support for highly heterogeneous systems to avoid
|
||
|
memory corruption when printing out the bindings
|
||
|
|
||
|
1.8.6: 17 Jun 2015
|
||
|
------------------
|
||
|
|
||
|
- Fixed memory leak on Mac OS-X exposed by TCP keepalive
|
||
|
- Fixed keepalive support to ensure that daemon/node failure
|
||
|
results in complete job cleanup
|
||
|
- Update Java binding support
|
||
|
- Fixed MPI_THREAD_MULTIPLE bug in vader shared memory BTL
|
||
|
- Fixed issue during shutdown when CUDA initialization wasn't complete
|
||
|
- Fixed orted environment when no prefix given
|
||
|
- Fixed trivial typo in MPI_Neighbor_allgather manpage
|
||
|
- Fixed tree-spawn support for sh and ksh shells
|
||
|
- Several data type fixes
|
||
|
- Fixed IPv6 support bug
|
||
|
- Cleaned up an unlikely build issue
|
||
|
- Fixed PMI2 process map parsing for cyclic mappings
|
||
|
- Fixed memalign threshold in openib BTL
|
||
|
- Fixed debugger access to message queues for blocking send/recv
|
||
|
|
||
|
|
||
|
1.8.5: 5 May 2015
|
||
|
-----------------
|
||
|
|
||
|
- Fixed configure problems in some cases when using an external hwloc
|
||
|
installation. Thanks to Erick Schnetter for reporting the error and
|
||
|
helping track down the source of the problem.
|
||
|
- Fixed linker error on OS X when using the clang compiler. Thanks to
|
||
|
Erick Schnetter for reporting the error and helping track down the
|
||
|
source of the problem.
|
||
|
- Fixed MPI_THREAD_MULTIPLE deadlock error in the vader BTL. Thanks
|
||
|
to Thomas Klimpel for reporting the issue.
|
||
|
- Fixed several Valgrind warnings. Thanks for Lisandro Dalcín for
|
||
|
contributing a patch fixing some one-sided code paths.
|
||
|
- Fixed version compatibility test in OOB that broke ABI within the
|
||
|
1.8 series. NOTE: this will not resolve the problem between pre-1.8.5
|
||
|
versions, but will fix it going forward.
|
||
|
- Fix some issues related to running on Intel Xeon Phi coprocessors.
|
||
|
- Opportunistically switch away from using GNU Libtool's libltdl
|
||
|
library when possible (by default).
|
||
|
- Fix some VampirTrace errors. Thanks to Paul Hargrove for reporting
|
||
|
the issues.
|
||
|
- Correct default binding patterns when --use-hwthread-cpus was
|
||
|
specified and nprocs <= 2.
|
||
|
- Fix warnings about -finline-functions when compiling with clang.
|
||
|
- Updated the embedded hwloc with several bug fixes, including the
|
||
|
"duplicate Lhwloc1 symbol" that multiple users reported on some
|
||
|
platforms.
|
||
|
- Do not error when mpirun is invoked with with default bindings
|
||
|
(i.e., no binding was specified), and one or more nodes do not
|
||
|
support bindings. Thanks to Annu Desari for pointing out the
|
||
|
problem.
|
||
|
- Let root invoke "mpirun --version" to check the version without
|
||
|
printing the "Don't run as root!" warnings. Thanks to Robert McLay
|
||
|
for the suggestion.
|
||
|
- Fixed several bugs in OpenSHMEM support.
|
||
|
- Extended vader shared memory support to 32-bit architectures.
|
||
|
- Fix handling of very large datatypes. Thanks to Bogdan Sataric for
|
||
|
the bug report.
|
||
|
- Fixed a bug in handling subarray MPI datatypes, and a bug when using
|
||
|
MPI_LB and MPI_UB. Thanks to Gus Correa for pointing out the issue.
|
||
|
- Restore user-settable bandwidth and latency PML MCA variables.
|
||
|
- Multiple bug fixes for cleanup during MPI_FINALIZE in unusual
|
||
|
situations.
|
||
|
- Added support for TCP keepalive signals to ensure timely termination
|
||
|
when sockets between daemons cannot be created (e.g., due to a
|
||
|
firewall).
|
||
|
- Added MCA parameter to allow full use of a SLURM allocation when
|
||
|
started from a tool (supports LLNL debugger).
|
||
|
- Fixed several bugs in the configure logic for PMI and hwloc.
|
||
|
- Fixed incorrect interface index in TCP communications setup. Thanks
|
||
|
to Mark Kettenis for spotting the problem and providing a patch.
|
||
|
- Fixed MPI_IREDUCE_SCATTER with single-process communicators when
|
||
|
MPI_IN_PLACE was not used.
|
||
|
- Added XRC support for OFED v3.12 and higher.
|
||
|
- Various updates and bug fixes to the Mellanox hcoll collective
|
||
|
support.
|
||
|
- Fix problems with Fortran compilers that did not support
|
||
|
REAL*16/COMPLEX*32 types. Thanks to Orion Poplawski for identifying
|
||
|
the issue.
|
||
|
- Fixed problem with rpath/runpath support in pkg-config files.
|
||
|
Thanks to Christoph Junghans for notifying us of the issue.
|
||
|
- Man page fixes:
|
||
|
- Removed erroneous "color" discussion from MPI_COMM_SPLIT_TYPE.
|
||
|
Thanks to Erick Schnetter for spotting the outdated text.
|
||
|
- Fixed prototypes for MPI_IBARRIER. Thanks to Maximilian for
|
||
|
finding the issue.
|
||
|
- Updated docs about buffer usage in non-blocking communications.
|
||
|
Thanks to Alexander Pozdneev for citing the outdated text.
|
||
|
- Added documentation about the 'ompi_unique' MPI_Info key with
|
||
|
MPI_PUBLISH_NAME.
|
||
|
- Fixed typo in MPI_INTERCOMM_MERGE. Thanks to Harald Servat for
|
||
|
noticing and sending a patch.
|
||
|
- Updated configure paths in HACKING. Thanks to Maximilien Levesque
|
||
|
for the fix.
|
||
|
- Fixed Fortran typo in MPI_WIN_LOCK_ALL. Thanks to Thomas Jahns
|
||
|
for pointing out the issue.
|
||
|
- Fixed a number of MPI one-sided bugs.
|
||
|
- Fixed MPI_COMM_SPAWN when invoked from a singleton job.
|
||
|
- Fixed a number of minor issues with CUDA support, including
|
||
|
registering of shared memory and supporting reduction support for
|
||
|
GPU buffers.
|
||
|
- Improved support for building OMPI on Cray platforms.
|
||
|
- Fixed performance regression introduced by the inadvertent default
|
||
|
enabling of MPI_THREAD_MULTIPLE support.
|
||
|
|
||
|
|
||
|
1.8.4: 19 Dec 2014
|
||
|
------------------
|
||
|
|
||
|
- Fix MPI_SIZEOF; now available in mpif.h for modern Fortran compilers
|
||
|
(see README for more details). Also fixed various compiler/linker
|
||
|
errors.
|
||
|
- Fixed inadvertant Fortran ABI break between v1.8.1 and v1.8.2 in the
|
||
|
mpi interface module when compiled with gfortran >= v4.9.
|
||
|
- Fix various MPI_THREAD_MULTIPLE issues in the TCP BTL.
|
||
|
- mpirun no longer requires the --hetero-nodes switch; it will
|
||
|
automatically detect when running in heterogeneous scenarios.
|
||
|
- Update LSF support, to include revamped affinity functionality.
|
||
|
- Update embedded hwloc to v1.9.1.
|
||
|
- Fixed max registerable memory computation in the openib BTL.
|
||
|
- Updated error message when debuggers are unable to find various
|
||
|
symbols/types to be more clear. Thanks to Dave Love for raising the
|
||
|
issue.
|
||
|
- Added proper support for LSF and PBS/Torque libraries in static builds.
|
||
|
- Rankfiles now support physical processor IDs.
|
||
|
- Fixed potential hang in MPI_ABORT.
|
||
|
- Fixed problems with the PSM MTL and "re-connect" scenarios, such as
|
||
|
MPI_INTERCOMM_CREATE.
|
||
|
- Fix MPI_IREDUCE_SCATTER with a single process.
|
||
|
- Fix (rare) race condition in stdout/stderr funneling to mpirun where
|
||
|
some trailing output could get lost when a process terminated.
|
||
|
- Removed inadvertent change that set --enable-mpi-thread-multiple "on"
|
||
|
by default, thus impacting performance for non-threaded apps.
|
||
|
- Significantly reduced startup time by optimizing internal hash table
|
||
|
implementation.
|
||
|
- Fixed OS X linking with the Fortran mpi module when used with
|
||
|
gfortran >= 4.9. Thanks to Github user yafshar for raising the
|
||
|
issue.
|
||
|
- Fixed memory leak on Cygwin platforms. Thanks for Marco Atzeri for
|
||
|
reporting the issue.
|
||
|
- Fixed seg fault in neighborhood collectives when the degree of the
|
||
|
topology is higher than the communicator size. Thanks to Lisandro
|
||
|
Dalcín for reporting the issue.
|
||
|
- Fixed segfault in neighborhood collectives under certain use-cases.
|
||
|
- Fixed various issues regarding Solaris support. Thanks to Siegmar
|
||
|
Gross for patiently identifying all the issues.
|
||
|
- Fixed PMI configure tests for certain Slurm installation patterns.
|
||
|
- Fixed param registration issue in Java bindings. Thanks to Takahiro
|
||
|
Kawashima and Siegmar Gross for identifying the issue.
|
||
|
- Several man page fixes.
|
||
|
- Silence several warnings and close some memory leaks (more remain,
|
||
|
but it's better than it was).
|
||
|
- Re-enabled the use of CMA and knem in the shared memory BTL.
|
||
|
- Updated mpirun manpage to correctly explain new map/rank/binding options.
|
||
|
- Fixed MPI_IALLGATHER problem with intercommunicators. Thanks for
|
||
|
Takahiro Kawashima for the patch.
|
||
|
- Numerous updates and performance improvements to OpenSHMEM.
|
||
|
- Turned off message coalescing in the openib BTL until a proper fix
|
||
|
for that capability can be provided (tentatively expected for 1.8.5)
|
||
|
- Fix a bug in iof output that dates back to the dinosaurs which would
|
||
|
output extra bytes if the system was very heavily loaded
|
||
|
- Fix a bug where specifying mca_component_show_load_errors=0 could
|
||
|
cause ompi_info to segfault
|
||
|
- Updated valgrind suppression file
|
||
|
|
||
|
|
||
|
1.8.3: 26 Sep 2014
|
||
|
------------------
|
||
|
|
||
|
- Fixed application abort bug to ensure that MPI_Abort exits appropriately
|
||
|
and returns the provided exit status
|
||
|
- Fixed some alignment (not all) issues identified by Clang
|
||
|
- Allow CUDA-aware to work with nonblocking collectives. Forces packing to
|
||
|
happen when using GPU buffers.
|
||
|
- Fixed configure test issue with Intel 2015 Fortran compiler
|
||
|
- Fixed some PGI-related errors
|
||
|
- Provide better help message when encountering a firewall
|
||
|
- Fixed MCA parameter quoting to protect multi-word params and params
|
||
|
that contain special characters
|
||
|
- Improved the bind-to help message to clarify the defaults
|
||
|
- Add new MPI-3.1 tools interface
|
||
|
- Several performance optimizations and memory leak cleanups
|
||
|
- Turn off the coll/ml plugin unless specifically requested as it
|
||
|
remains in an experimental state
|
||
|
- Fix LSF support by adding required libraries for the latest LSF
|
||
|
releases. Thanks to Joshua Randal for supplying the initial
|
||
|
patches.
|
||
|
|
||
|
|
||
|
1.8.2: 25 Aug 2014
|
||
|
------------------
|
||
|
|
||
|
- Fix auto-wireup of OOB, allowing ORTE to automatically
|
||
|
test all available NICs
|
||
|
- "Un-deprecate" pernode, npernode, and npersocket options
|
||
|
by popular demand
|
||
|
- Add missing Fortran bindings for MPI_WIN_LOCK_ALL,
|
||
|
MPI_WIN_UNLOCK_ALL, and MPI_WIN_SYNC.
|
||
|
- Fix cascading/over-quoting in some cases with the rsh/ssh-based
|
||
|
launcher. Thanks to multiple users for raising the issue.
|
||
|
- Properly add support for gfortran 4.9 ignore TKR pragma (it was
|
||
|
erroneously only partially added in v1.7.5). Thanks to Marcus
|
||
|
Daniels for raising the issue.
|
||
|
- Update/improve help messages in the usnic BTL.
|
||
|
- Resolve a race condition in MPI_Abort.
|
||
|
- Fix obscure cases where static linking from wrapper compilers would
|
||
|
fail.
|
||
|
- Clarify the configure --help message about when OpenSHMEM is
|
||
|
enabled/disabled by default. Thanks to Paul Hargrove for the
|
||
|
suggestion.
|
||
|
- Align pages properly where relevant. Thanks to Paul Hargrove for
|
||
|
identifying the issue.
|
||
|
- Various compiler warning and minor fixes for OpenBSD, FreeBSD, and
|
||
|
Solaris/SPARC. Thanks to Paul Hargrove for the patches.
|
||
|
- Properly pass function pointers from Fortran to C in the mpi_f08
|
||
|
module, thereby now supporting gfortran 4.9. Thanks to Tobias
|
||
|
Burnus for assistance and testing with this issue.
|
||
|
- Improve support for Cray CLE 5.
|
||
|
- Fix mpirun regression: ensure exit status is non-zero if mpirun is
|
||
|
terminated due to signal.
|
||
|
- Improved CUDA efficiency of asynchronous copies.
|
||
|
- Fix to parameter type in MPI_Type_indexed.3. Thanks to Bastian
|
||
|
Beischer for reporting the mistake.
|
||
|
- Fix NUMA distance calculations in the openib BTL.
|
||
|
- Decrease time required to shut down mpirun at the end of a job.
|
||
|
- More RMA fixes.
|
||
|
- More hostfile fixes from Tetsuya Mishima.
|
||
|
- Fix darray issue where UB was not computed correctly.
|
||
|
- Fix mpi_f08 parameter name for MPI_GET_LIBRARY_VERSION. Thanks to
|
||
|
Junchao Zhang for pointing out the issue.
|
||
|
- Ensure mpirun aborts properly when unable to map processes in
|
||
|
scheduled environments.
|
||
|
- Ensure that MPI RMA error codes show up properly. Thanks to
|
||
|
Lisandro Dalcín for reporting the issue.
|
||
|
- Minor bug fixes and improvements to the bash and zsh mpirun
|
||
|
autocompletion scripts.
|
||
|
- Fix sequential mpirun process mapper. Thanks to Bill Chen for
|
||
|
reporting the issue.
|
||
|
- Correct SLURM stdout/stderr redirection.
|
||
|
- Added missing portals 4 files.
|
||
|
- Performance improvements for blocking sends and receives.
|
||
|
- Lots of cleanup to the ml collective component
|
||
|
- Added new Java methods to provide full MPI coverage
|
||
|
- Many OSHMEM cleanups
|
||
|
- Prevent comm_spawn from automatically launching a VM across
|
||
|
all available nodes
|
||
|
- Close many memory leaks to achieve valgrind-clean operation
|
||
|
- Better handling of TCP connection discovery for mismatched networks
|
||
|
where we don't have a direct 1:1 subnet match between nodes
|
||
|
- Prevent segfault when OMPI info tools are used in pipes and user
|
||
|
exits one step of that pipe before completing output
|
||
|
|
||
|
|
||
|
1.8.1: 23 Apr 2014
|
||
|
------------------
|
||
|
|
||
|
- Fix for critical bug: mpirun removed files (but not directories)
|
||
|
from / when run as root. Thanks to Jay Fenlason and Orion Poplawski
|
||
|
for bringing the issue to our attention and helping identify the
|
||
|
fix.
|
||
|
|
||
|
|
||
|
1.8: 31 Mar 2014
|
||
|
----------------
|
||
|
|
||
|
- Commit upstream ROMIO fix for mixed NFS+local filesystem environments.
|
||
|
- Several fixes for MPI-3 one-sided support. For example,
|
||
|
arbitrary-length datatypes are now supported.
|
||
|
- Add config support for the Mellanox ConnectX 4 card.
|
||
|
- Add missing MPI_COMM_GET|SET_INFO functions, and missing
|
||
|
MPI_WEIGHTS_EMPTY and MPI_ERR_RMA_SHARED constants. Thanks to
|
||
|
Lisandro Dalcín for pointing out the issue.
|
||
|
- Update some help messages in OSHMEM, the usnic BTL, the TCP BTL, and
|
||
|
ORTE, and update documentation about ompi_info's --level option.
|
||
|
- Fix some compiler warnings.
|
||
|
- Ensure that ORTE daemons are not bound to a single processor
|
||
|
if TaskAffinity is set on by default in Slurm. Thanks to Artem Polyakov
|
||
|
for identifying the problem and providing a patch
|
||
|
|
||
|
|
||
|
1.7.5 20 Mar 2014
|
||
|
-----------------
|
||
|
|
||
|
**********************************************************************
|
||
|
* Open MPI is now fully MPI-3.0 compliant
|
||
|
**********************************************************************
|
||
|
|
||
|
- Add Linux OpenSHMEM support built on top of Open MPI's MPI
|
||
|
layer. Thanks to Mellanox for contributing this new feature.
|
||
|
- Allow restricting ORTE daemons to specific cores using the
|
||
|
orte_daemon_cores MCA param.
|
||
|
- Ensure to properly set "locality" flags for processes launched via
|
||
|
MPI dynamic functions such as MPI_COMM_SPAWN.
|
||
|
- Fix MPI_GRAPH_CREATE when nnodes is smaller than the size of the old
|
||
|
communicator.
|
||
|
- usnic BTL now supports underlying UDP transport.
|
||
|
- usnic BTL now checks for common connectivty errors at first send to
|
||
|
a remote server.
|
||
|
- Minor scalability improvements in the usnic BTL.
|
||
|
- ompi_info now lists whether the Java MPI bindings are available or not.
|
||
|
- MPI-3: mpi.h and the Fortran interfaces now report MPI_VERSION==3
|
||
|
and MPI_SUBVERSION==0.
|
||
|
- MPI-3: Added support for new RMA functions and functionality.
|
||
|
- Fix MPI_Info "const buglet. Thanks to Orion Poplawski for
|
||
|
identifying the issue.
|
||
|
- Multiple fixes to mapping/binding options. Thanks to Tetsuya Mishima
|
||
|
for his assistance.
|
||
|
- Multiple fixes for normal and abnormal process termination,
|
||
|
including singleton MPI_Abort and ensuring to kill entire process
|
||
|
groups when abnormally terminating a job.
|
||
|
- Fix DESTDIR install for javadocs. Thanks to Orion Poplawski for
|
||
|
pointing out the issue.
|
||
|
- Various performance improvements for the MPI Java bindings.
|
||
|
- OMPI now uses its own internal random number generator and will not
|
||
|
perturb srand() and friends.
|
||
|
- Some cleanups for Cygwin builds. Thanks to Marco Atzeri for the
|
||
|
patches.
|
||
|
- Add a new collective component (coll/ml) that provides substantially
|
||
|
improved performance. It is still experimental, and requires
|
||
|
setting coll_ml_priority > 0 to become active.
|
||
|
- Add version check during startup to ensure you are using the same
|
||
|
version of Open MPI on all nodes in a job.
|
||
|
- Significantly improved the performance of MPI_DIMS_CREATE for large
|
||
|
values. Thanks to Andreas Schäfer for the contribution.
|
||
|
- Removed ASYNCHRONOUS keyword from the "ignore TKR" mpi_f08 module.
|
||
|
- Deprecated the following mpirun options:
|
||
|
--bynode, --bycore, --byslot: replaced with --map-by node|core|slot.
|
||
|
--npernode, --npersocket: replaced with --map-by ppr:N:node and
|
||
|
--map-by ppr:N:socket, respectively
|
||
|
- Pick NFS "infinitely stale" fix from ROMIO upstream.
|
||
|
- Various PMI2 fixes and extension to support broader range of mappings.
|
||
|
- Improve launch performance at large scale.
|
||
|
- Add support for PBS/Torque environments that set environment
|
||
|
variables to indicate the number of slots available on each nodes.
|
||
|
Set the ras_tm_smp MCA parameter to "1" to enable this mode.
|
||
|
- Add new, more scalable endpoint exchange (commonly called "modex")
|
||
|
method that only exchanges endpoint data on a per-peer basis
|
||
|
on first message. Not all transports have been updated to use
|
||
|
this feature. Set the rte_orte_direct_modex parameter to "1"
|
||
|
to enable this mode.
|
||
|
|
||
|
|
||
|
1.7.4: 5 Feb 2014
|
||
|
-----------------
|
||
|
|
||
|
**********************************************************************
|
||
|
* CRITICAL CHANGE
|
||
|
*
|
||
|
* As of release 1.7.4, OpenMPI's default mapping, ranking, and binding
|
||
|
* settings have changed:
|
||
|
*
|
||
|
* Mapping:
|
||
|
* if #procs <= 2, default to map-by core
|
||
|
* if #procs > 2, default to map-by socket
|
||
|
* Ranking:
|
||
|
* if default mapping is used, then default to rank-by slot
|
||
|
* if map-by <obj> is given, then default to rank-by <obj>,
|
||
|
* where <obj> is whatever object we mapped against
|
||
|
* Binding:
|
||
|
* default to bind-to core
|
||
|
*
|
||
|
* Users can override any of these settings individually using the
|
||
|
* corresponding MCA parameter. Note that multi-threaded applications
|
||
|
* in particular may want to override at least the binding default
|
||
|
* to allow threads to use multiple cores.
|
||
|
**********************************************************************
|
||
|
|
||
|
- Restore version number output in "ompi_info --all".
|
||
|
- Various bug fixes for the mpi_f08 Fortran bindings.
|
||
|
- Fix ROMIO compile error with Lustre 2.4. Thanks to Adam Moody for
|
||
|
reporting the issue.
|
||
|
- Various fixes for 32 bit platforms.
|
||
|
- Add ability to selectively disable building the mpi or mpi_f08
|
||
|
module. See the README file for details.
|
||
|
- Fix MX MTL finalization issue.
|
||
|
- Fix ROMIO issue when opening a file with MPI_MODE_EXCL.
|
||
|
- Fix PowerPC and MIPS assembly issues.
|
||
|
- Various fixes to the hcoll and FCA collective offload modules.
|
||
|
- Prevent integer overflow when creating datatypes. Thanks to
|
||
|
original patch from Gilles Gouaillardet.
|
||
|
- Port some upstream hwloc fixes to Open MPI's embedded copy for
|
||
|
working around buggy NUMA node cpusets and including mising header
|
||
|
files. Thanks to Jeff Becker and Paul Hargrove for reporting the
|
||
|
issues.
|
||
|
- Fix recursive invocation issues in the MXM MTL.
|
||
|
- Various bug fixes to the new MCA parameter back-end system.
|
||
|
- Have the posix fbtl module link against -laio on NetBSD platforms.
|
||
|
Thanks to Paul Hargrove for noticing the issue.
|
||
|
- Various updates and fixes to network filesystem detection to support
|
||
|
more operating systems.
|
||
|
- Add gfortran v4.9 "ignore TKR" syntax to the mpi Fortran module.
|
||
|
- Various compiler fixes for several BSD-based platforms. Thanks to
|
||
|
Paul Hargrove for reporting the issues.
|
||
|
- Fix when MPI_COMM_SPAWN[_MULTIPLE] is used on oversubscribed
|
||
|
systems.
|
||
|
- Change the output from --report bindings to simply state that a
|
||
|
process is not bound, instead of reporting that it is bound to all
|
||
|
processors.
|
||
|
- Per MPI-3.0 guidance, remove support for all MPI subroutines with
|
||
|
choice buffers from the TKR-based mpi Fortran module. Thanks to Jed
|
||
|
Brown for raising the issue.
|
||
|
- Only allow the usnic BTL to build on 64 bit platforms.
|
||
|
- Various bug fixes to SLURM support, to include ensuring proper
|
||
|
exiting on abnormal termination.
|
||
|
- Ensure that MPI_COMM_SPAWN[_MULTIPLE] jobs get the same mapping
|
||
|
directives that were used with mpirun.
|
||
|
- Fixed the application of TCP_NODELAY.
|
||
|
- Change the TCP BTL to not warn if a non-existent interface is
|
||
|
ignored.
|
||
|
- Restored the "--bycore" mpirun option for backwards compatibility.
|
||
|
- Fixed debugger attach functionality. Thanks to Ashley Pittman for
|
||
|
reporting the issue and suggesting the fix.
|
||
|
- Fixed faulty MPI_IBCAST when invoked on a communicator with only
|
||
|
one process.
|
||
|
- Add new Mellanox device IDs to the openib BTL.
|
||
|
- Progress towards cleaning up various internal memory leaks as
|
||
|
reported by Valgrind.
|
||
|
- Fixed some annoying flex-generated warnings that have been there for
|
||
|
years. Thanks to Tom Fogal for the initial patch.
|
||
|
- Support user-provided environment variables via the "env" info key
|
||
|
to MPI_COMM_SPAWN[_MULTIPLE]. Thanks to Tom Fogal for the feature
|
||
|
request.
|
||
|
- Fix uninitialized variable in MPI_DIST_GRAPH_CREATE.
|
||
|
- Fix a variety of memory errors on SPARC platforms. Thanks to
|
||
|
Siegmar Gross for reporting and testing all the issues.
|
||
|
- Remove Solaris threads support. When building on Solaris, pthreads
|
||
|
will be used.
|
||
|
- Correctly handle the convertor internal stack for persistent
|
||
|
receives. Thanks to Guillaume Gouaillardet for identifying the
|
||
|
problem.
|
||
|
- Add support for using an external libevent via --with-libevent. See
|
||
|
the README for more details.
|
||
|
- Various OMPIO updates and fixes.
|
||
|
- Add support for the MPIEXEC_TIMEOUT environment variable. If set,
|
||
|
mpirun will terminate the job after this many seconds.
|
||
|
- Update the internal copy of ROMIO to that which shipped in MPICH
|
||
|
3.0.4.
|
||
|
- Various performance tweaks and improvements in the usnic BTL,
|
||
|
including now reporting MPI_T performance variables for each usnic
|
||
|
device.
|
||
|
- Fix to not access send datatypes for non-root processes with
|
||
|
MPI_ISCATTER[V] and MPI_IGATHER[V]. Thanks to Pierre Jolivet for
|
||
|
supplying the initial patch.
|
||
|
- Update VampirTrace to 5.14.4.9.
|
||
|
- Fix ptmalloc2 hook disable when used with ummunotify.
|
||
|
- Change the default connection manager for the openib BTL to be based
|
||
|
on UD verbs data exchanges instead of ORTE OOB data exchanges.
|
||
|
- Fix Fortran compile error when compiling with 8-byte INTEGERs and
|
||
|
4-byte ints.
|
||
|
- Fix C++11 issue identified by Jeremiah Willcock.
|
||
|
- Many changes, updates, and bug fixes to the ORTE run-time layer.
|
||
|
- Correctly handle MPI_REDUCE_SCATTER with recvcounts of 0.
|
||
|
- Update man pages for MPI-3, and add some missing man pages for
|
||
|
MPI-2.x functions.
|
||
|
- Updated mpi_f08 module in accordance with post-MPI-3.0 errata which
|
||
|
basically removed BIND(C) from all interfaces.
|
||
|
- Fixed MPI_IN_PLACE detection for MPI_SCATTER[V] in Fortran
|
||
|
routines. Thanks to Charles Gerlach for identifying the issue.
|
||
|
- Added support for routable RoCE to the openib BTL.
|
||
|
- Update embedded hwloc to v1.7.2.
|
||
|
- ErrMgr framework redesigned to better support fault tolerance development
|
||
|
activities. See the following RFC for details:
|
||
|
http://www.open-mpi.org/community/lists/devel/2010/03/7589.php
|
||
|
- Added database framework to OPAL and changed all modex operations
|
||
|
to flow thru it, also included additional system info in the
|
||
|
available data
|
||
|
- Added staged state machine to support sequential work flows
|
||
|
- Added distributed file system support for accessing files across
|
||
|
nodes that do not have networked file systems
|
||
|
- Extended filem framework to support scalable pre-positioning of
|
||
|
files for use by applications, adding new "raw" component that
|
||
|
transmits files across the daemon network
|
||
|
- Native Windows support has been removed. A cygwin package is
|
||
|
available from that group for Windows-based use.
|
||
|
- Added new MPI Java bindings. See the Javadocs for more details on
|
||
|
the API.
|
||
|
- Wrapper compilers now add rpath support by default to generated
|
||
|
executables on systems that support it. This behavior can be
|
||
|
disabled via --disable-wrapper-rpath. See note in README about ABI
|
||
|
issues when using rpath in MPI applications.
|
||
|
- Added a new parallel I/O component and multiple new frameworks to
|
||
|
support parallel I/O operations.
|
||
|
- Fixed MPI_STATUS_SIZE Fortran issue when used with 8-byte Fortran
|
||
|
INTEGERs and 4-byte C ints. Since this issue affects ABI, it is
|
||
|
only enabled if Open MPI is configured with
|
||
|
--enable-abi-breaking-fortran-status-i8-fix. Thanks to Jim Parker
|
||
|
for supplying the initial patch.
|
||
|
- Add support for Intel Phi SCIF transport.
|
||
|
- For CUDA-aware MPI configured with CUDA 6.0, use new pointer
|
||
|
attribute to avoid extra synchronization in stream 0 when using
|
||
|
CUDA IPC between GPUs on the same node.
|
||
|
- For CUDA-aware MPI configured with CUDA 6.0, compile in support
|
||
|
of GPU Direct RDMA in openib BTL to improve small message latency.
|
||
|
- Updated ROMIO from MPICH v3.0.4.
|
||
|
- MPI-3: Added support for remaining non-blocking collectives.
|
||
|
- MPI-3: Added support for neighborhood collectives.
|
||
|
- MPI-3: Updated C bindings with consistent use of [].
|
||
|
- MPI-3: Added the const keyword to read-only buffers.
|
||
|
- MPI-3: Added support for non-blocking communicator duplication.
|
||
|
- MPI-3: Added support for non-collective communicator creation.
|
||
|
|
||
|
|
||
|
1.7.3: 17 Oct 2013
|
||
|
------------------
|
||
|
|
||
|
- Make CUDA-aware support dynamically load libcuda.so so CUDA-aware
|
||
|
MPI library can run on systems without CUDA software.
|
||
|
- Fix various issues with dynamic processes and intercommunicator
|
||
|
operations under Torque. Thanks to Suraj Prabhakaran for reporting
|
||
|
the problem.
|
||
|
- Enable support for the Mellanox MXM2 library by default.
|
||
|
- Improve support for Portals 4.
|
||
|
- Various Solaris fixes. Many thanks to Siegmar Gross for his
|
||
|
incredible patience in reporting all the issues.
|
||
|
- MPI-2.2: Add reduction support for MPI_C_*COMPLEX and MPI::*COMPLEX.
|
||
|
- Fixed internal accounting when openpty() fails. Thanks to Michal
|
||
|
Peclo for reporting the issue and providing a patch.
|
||
|
- Fixed too-large memory consumption in XRC mode of the openib BTL.
|
||
|
Thanks to Alexey Ryzhikh for the patch.
|
||
|
- Add bozo check for negative np values to mpirun to prevent a
|
||
|
deadlock. Thanks to Upinder Malhi for identifying the issue.
|
||
|
- Fixed MPI_IS_THREAD_MAIN behavior. Thanks to Lisandro Dalcín for
|
||
|
pointing out the problem.
|
||
|
- Various rankfile fixes.
|
||
|
- Fix functionality over iWARP devices.
|
||
|
- Various memory and performance optimizations and tweaks.
|
||
|
- Fix MPI_Cancel issue identified by Fujitsu.
|
||
|
- Add missing support for MPI_Get_address in the "use mpi" TKR
|
||
|
implementation. Thanks to Hugo Gagnon for identifying the issue.
|
||
|
- MPI-3: Add support for MPI_Count.
|
||
|
- MPI-2.2: Add missing MPI_IN_PLACE support for MPI_ALLTOALL.
|
||
|
- Added new usnic BTL to support the Cisco usNIC device.
|
||
|
- Minor VampirTrace update to 5.14.4.4.
|
||
|
- Removed support for ancient OS X systems (i.e., prior to 10.5).
|
||
|
- Fixed obscure packing/unpacking datatype bug. Thanks to Takahiro
|
||
|
Kawashima for identifying the issue.
|
||
|
- Add run-time support for PMI2 environments.
|
||
|
- Update openib BTL default parameters to include support for Mellanox
|
||
|
ConnectX3-Pro devices.
|
||
|
- Update libevent to v2.0.21.
|
||
|
- "ompi_info --param TYPE PLUGIN" now only shows a small number of MCA
|
||
|
parameters by default. Add "--level 9" or "--all" to see *all* MCA
|
||
|
parameters. See README for more details.
|
||
|
- Add support for asynchronous CUDA-aware copies.
|
||
|
- Add support for Mellanox MPI collective operation offload via the
|
||
|
"hcoll" library.
|
||
|
- MPI-3: Add support for the MPI_T interface. Open MPI's MCA
|
||
|
parameters are now accessible via the MPI_T control variable
|
||
|
interface. Support has been added for a small number of MPI_T
|
||
|
performance variables.
|
||
|
- Add Gentoo memory hooks override. Thanks to Justin Bronder for the
|
||
|
patch.
|
||
|
- Added new "mindist" process mapper, allowing placement of processes
|
||
|
via PCI locality information reported by the BIOS.
|
||
|
- MPI-2.2: Add support for MPI_Dist_graph functionality.
|
||
|
- Enable generic, client-side support for PMI2 implementations. Can
|
||
|
be leveraged by any resource manager that implements PMI2; e.g. SLURM,
|
||
|
versions 2.6 and higher.
|
||
|
|
||
|
1.7.2: 26 Jun 2013
|
||
|
------------------
|
||
|
|
||
|
- Major VampirTrace update to 5.14.4.2.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix to set flag==1 when MPI_IPROBE is called with MPI_PROC_NULL.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Set the Intel Phi device to be ignored by default by the openib BTL.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Decrease the internal memory storage used by intrinsic MPI datatypes
|
||
|
for Fortran types. Thanks to Takahiro Kawashima for the initial
|
||
|
patch.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix total registered memory calculation for Mellanox ConnectIB and
|
||
|
OFED 2.0.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix possible data corruption in the MXM MTL component.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Remove extraneous -L from hwloc's embedding. Thanks to Stefan
|
||
|
Friedel for reporting the issue.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix contiguous datatype memory check. Thanks to Eric Chamberland
|
||
|
for reporting the issue.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Make the openib BTL more friendly to ignoring verbs devices that are
|
||
|
not RC-capable.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix some MPI datatype engine issues. Thanks to Thomas Jahns for
|
||
|
reporting the issue.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Add INI information for Chelsio T5 device.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Integrate MXM STREAM support for MPI_ISEND and MPI_IRECV, and other
|
||
|
minor MXM fixes.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Fix to not show amorphous "MPI was already finalized" error when
|
||
|
failing to MPI_File_close an open file. Thanks to Brian Smith for
|
||
|
reporting the issue.
|
||
|
(** also appeared: 1.6.5)
|
||
|
- Add a distance-based mapping component to find the socket "closest"
|
||
|
to the PCI bus.
|
||
|
- Fix an error that caused epoll to automatically be disabled
|
||
|
in libevent.
|
||
|
- Upgrade hwloc to 1.5.2.
|
||
|
- *Really* fixed XRC compile issue in Open Fabrics support.
|
||
|
- Fix MXM connection establishment flow.
|
||
|
- Fixed parallel debugger ability to attach to MPI jobs.
|
||
|
- Fixed some minor memory leaks.
|
||
|
- Fixed datatype corruption issue when combining datatypes of specific
|
||
|
formats.
|
||
|
- Added Location Aware Mapping Algorithm (LAMA) mapping component.
|
||
|
- Fixes for MPI_STATUS handling in corner cases.
|
||
|
- Add a distance-based mapping component to find the socket "closest"
|
||
|
to the PCI bus.
|
||
|
|
||
|
|
||
|
1.7.1: 16 Apr 2013
|
||
|
------------------
|
||
|
|
||
|
- Fixed compile error when --without-memory-manager was specified
|
||
|
on Linux
|
||
|
- Fixed XRC compile issue in Open Fabrics support.
|
||
|
|
||
|
|
||
|
1.7: 1 Apr 2013
|
||
|
---------------
|
||
|
|
||
|
- Added MPI-3 functionality:
|
||
|
- MPI_GET_LIBRARY_VERSION
|
||
|
- Matched probe
|
||
|
- MPI_TYPE_CREATE_HINDEXED_BLOCK
|
||
|
- Non-blocking collectives
|
||
|
- MPI_INFO_ENV support
|
||
|
- Fortran '08 bindings (see below)
|
||
|
- Dropped support for checkpoint/restart due to loss of maintainer :-(
|
||
|
- Enabled compile-time warning of deprecated MPI functions by default
|
||
|
(in supported compilers).
|
||
|
- Revamped Fortran MPI bindings (see the README for details):
|
||
|
- "mpifort" is now the preferred wrapper compiler for Fortran
|
||
|
- Added "use mpi_f08" bindings (for compilers that support it)
|
||
|
- Added better "use mpi" support (for compilers that support it)
|
||
|
- Removed incorrect MPI_SCATTERV interface from "mpi" module that
|
||
|
was added in the 1.5.x series for ABI reasons.
|
||
|
- Lots of VampirTrace upgrades and fixes; upgrade to v5.14.3.
|
||
|
- Modified process affinity system to provide warning when bindings
|
||
|
result in being "bound to all", which is equivalent to not being
|
||
|
bound.
|
||
|
- Removed maffinity, paffinity, and carto frameworks (and associated
|
||
|
MCA params).
|
||
|
- Upgraded to hwloc v1.5.1.
|
||
|
- Added performance improvements to the OpenIB (OpenFabrics) BTL.
|
||
|
- Made malloc hooks more friendly to IO interprosers. Thanks to the
|
||
|
bug report and suggested fix from Darshan maintainer Phil Carns.
|
||
|
- Added support for the DMTCP checkpoint/restart system.
|
||
|
- Added support for the Cray uGNI interconnect.
|
||
|
- Fixed header file problems on OpenBSD.
|
||
|
- Fixed issue with MPI_TYPE_CREATE_F90_REAL.
|
||
|
- Wrapper compilers now explicitly list/link all Open MPI libraries if
|
||
|
they detect static linking CLI arguments.
|
||
|
- Open MPI now requires a C99 compiler to build. Please upgrade your
|
||
|
C compiler if you do not have a C99-compliant compiler.
|
||
|
- Fix MPI_GET_PROCESSOR_NAME Fortran binding to set ierr properly.
|
||
|
Thanks to LANL for spotting the error.
|
||
|
- Many MXM and FCA updates.
|
||
|
- Fixed erroneous free of putenv'ed string that showed up in Valgrind
|
||
|
reports.
|
||
|
- Fixed MPI_IN_PLACE case for MPI_ALLGATHER.
|
||
|
- Fixed a bug that prevented MCA params from being forwarded to
|
||
|
daemons upon launch.
|
||
|
- Fixed issues with VT and CUDA --with-cuda[-libdir] configuration CLI
|
||
|
parameters.
|
||
|
- Entirely new implementation of many MPI collective routines focused
|
||
|
on better performance.
|
||
|
- Revamped autogen / build system.
|
||
|
- Add new sensor framework to ORTE that includes modules for detecting
|
||
|
stalled applications and processes that consume too much memory.
|
||
|
- Added new state machine framework to ORTE that converts ORTE into an
|
||
|
event-driven state machine using the event library.
|
||
|
- Added a new MCA parameter (ess_base_stream_buffering) that allows the user
|
||
|
to override the system default for buffering of stdout/stderr streams
|
||
|
(via setvbuf). Parameter is not visible via ompi_info.
|
||
|
- Revamped the launch system to allow consideration of node hardware
|
||
|
in assigning process locations and bindings.
|
||
|
- Added the -novm option to preserve the prior launch behavior.
|
||
|
- Revamped the process mapping system to utilize node hardware by adding
|
||
|
new map-by, rank-by, and bind-to cmd line options.
|
||
|
- Added new MCA parameter to provide protection against IO forwarding
|
||
|
backlog.
|
||
|
- Dropped support for native Windows due to loss of maintainers. :-(
|
||
|
- Added a new parallel I/O component and multiple new frameworks to
|
||
|
support parallel I/O operations.
|
||
|
- Fix typo in orte_setup_hadoop.m4. Thanks to Aleksej Saushev for
|
||
|
reporting it
|
||
|
- Fix a very old error in opal_path_access(). Thanks to Marco Atzeri
|
||
|
for chasing it down.
|
||
|
|
||
|
|
||
|
1.6.6: Not released
|
||
|
-------------------
|
||
|
|
||
|
- Prevent integer overflow in datatype creation. Thanks to Gilles
|
||
|
Gouaillardet for identifying the problem and providing a preliminary
|
||
|
version of the patch.
|
||
|
- Ensure help-opal-hwloc-base.txt is included in distribution
|
||
|
tarballs. Thanks to Gilles Gouaillardet for supplying the patch.
|
||
|
- Correctly handle the invalid status for NULL and inactive requests.
|
||
|
Thanks to KAWASHIMA Takahiro for submitting the initial patch.
|
||
|
- Fixed MPI_STATUS_SIZE Fortran issue when used with 8-byte Fortran
|
||
|
INTEGERs and 4-byte C ints. Since this issue affects ABI, it is
|
||
|
only enabled if Open MPI is configured with
|
||
|
--enable-abi-breaking-fortran-status-i8-fix. Thanks to Jim Parker
|
||
|
for supplying the initial patch.
|
||
|
- Fix datatype issue for sending from the middle of non-contiguous
|
||
|
data.
|
||
|
- Fixed failure error with pty support. Thanks to Michal Pecio for
|
||
|
the patch.
|
||
|
- Fixed debugger support for direct-launched jobs.
|
||
|
- Fix MPI_IS_THREAD_MAIN to return the correct value. Thanks to
|
||
|
Lisandro Dalcín for pointing out the issue.
|
||
|
- Update VT to 5.14.4.4:
|
||
|
- Fix C++-11 issue.
|
||
|
- Fix support for building RPMs on Fedora with CUDA libraries.
|
||
|
- Add openib part number for ConnectX3-Pro HCA.
|
||
|
- Ensure to check that all resolved IP addresses are local.
|
||
|
- Fix MPI_COMM_SPAWN via rsh when mpirun is on a different server.
|
||
|
- Add Gentoo "sandbox" memory hooks override.
|
||
|
|
||
|
|
||
|
1.6.5: 26 Jun 2013
|
||
|
------------------
|
||
|
|
||
|
- Updated default SRQ parameters for the openib BTL.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Major VampirTrace update to 5.14.4.2.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Fix to set flag==1 when MPI_IPROBE is called with MPI_PROC_NULL.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Set the Intel Phi device to be ignored by default by the openib BTL.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Decrease the internal memory storage used by intrinsic MPI datatypes
|
||
|
for Fortran types. Thanks to Takahiro Kawashima for the initial
|
||
|
patch.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Fix total registered memory calculation for Mellanox ConnectIB and
|
||
|
OFED 2.0.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Fix possible data corruption in the MXM MTL component.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Remove extraneous -L from hwloc's embedding. Thanks to Stefan
|
||
|
Friedel for reporting the issue.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Fix contiguous datatype memory check. Thanks to Eric Chamberland
|
||
|
for reporting the issue.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Make the openib BTL more friendly to ignoring verbs devices that are
|
||
|
not RC-capable.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Fix some MPI datatype engine issues. Thanks to Thomas Jahns for
|
||
|
reporting the issue.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Add INI information for Chelsio T5 device.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Integrate MXM STREAM support for MPI_ISEND and MPI_IRECV, and other
|
||
|
minor MXM fixes.
|
||
|
(** also to appear: 1.7.2)
|
||
|
- Improved alignment for OpenFabrics buffers.
|
||
|
- Fix to not show amorphous "MPI was already finalized" error when
|
||
|
failing to MPI_File_close an open file. Thanks to Brian Smith for
|
||
|
reporting the issue.
|
||
|
(** also to appear: 1.7.2)
|
||
|
|
||
|
|
||
|
1.6.4: 21 Feb 2013
|
||
|
------------------
|
||
|
|
||
|
- Fix Cygwin shared memory and debugger plugin support. Thanks to
|
||
|
Marco Atzeri for reporting the issue and providing initial patches.
|
||
|
- Fix to obtaining the correct available nodes when a rankfile is
|
||
|
providing the allocation. Thanks to Siegmar Gross for reporting the
|
||
|
problem.
|
||
|
- Fix process binding issue on Solaris. Thanks to Siegmar Gross for
|
||
|
reporting the problem.
|
||
|
- Updates for MXM 2.0.
|
||
|
- Major VT update to 5.14.2.3.
|
||
|
- Fixed F77 constants for Cygwin/Cmake build.
|
||
|
- Fix a linker error when configuring --without-hwloc.
|
||
|
- Automatically provide compiler flags that compile properly on some
|
||
|
types of ARM systems.
|
||
|
- Fix slot_list behavior when multiple sockets are specified. Thanks
|
||
|
to Siegmar Gross for reporting the problem.
|
||
|
- Fixed memory leak in one-sided operations. Thanks to Victor
|
||
|
Vysotskiy for letting us know about this one.
|
||
|
- Added performance improvements to the OpenIB (OpenFabrics) BTL.
|
||
|
- Improved error message when process affinity fails.
|
||
|
- Fixed MPI_MINLOC on man pages for MPI_REDUCE(_LOCAL). Thanks to Jed
|
||
|
Brown for noticing the problem and supplying a fix.
|
||
|
- Made malloc hooks more friendly to IO interprosers. Thanks to the
|
||
|
bug report and suggested fix from Darshan maintainer Phil Carns.
|
||
|
- Restored ability to direct launch under SLURM without PMI support.
|
||
|
- Fixed MPI datatype issues on OpenBSD.
|
||
|
- Major VT update to 5.14.2.3.
|
||
|
- Support FCA v3.0+.
|
||
|
- Fixed header file problems on OpenBSD.
|
||
|
- Fixed issue with MPI_TYPE_CREATE_F90_REAL.
|
||
|
- Fix an issue with using external libltdl installations. Thanks to
|
||
|
opolawski for identifying the problem.
|
||
|
- Fixed MPI_IN_PLACE case for MPI_ALLGATHER for FCA.
|
||
|
- Allow SLURM PMI support to look in lib64 directories. Thanks to
|
||
|
Guillaume Papaure for the patch.
|
||
|
- Restore "use mpi" ABI compatibility with the rest of the 1.5/1.6
|
||
|
series (except for v1.6.3, where it was accidentally broken).
|
||
|
- Fix a very old error in opal_path_access(). Thanks to Marco Atzeri
|
||
|
for chasing it down.
|
||
|
|
||
|
|
||
|
1.6.3: 30 Oct 2012
|
||
|
------------------
|
||
|
|
||
|
- Fix mpirun --launch-agent behavior when a prefix is specified.
|
||
|
Thanks to Reuti for identifying the issue.
|
||
|
- Fixed memchecker configury.
|
||
|
- Brought over some compiler warning squashes from the development trunk.
|
||
|
- Fix spawning from a singleton to multiple hosts when the "add-host"
|
||
|
MPI_Info key is used. Thanks to Brian Budge for pointing out the
|
||
|
problem.
|
||
|
- Add Mellanox ConnextIB IDs and max inline value.
|
||
|
- Fix rankfile when no -np is given.
|
||
|
- FreeBSD detection improvement. Thanks to Brooks Davis for the
|
||
|
patch.
|
||
|
- Removed TCP warnings on Windows.
|
||
|
- Improved collective algorithm selection for very large messages.
|
||
|
- Fix PSM MTL affinity settings.
|
||
|
- Fix issue with MPI_OP_COMMUTATIVE in the mpif.h bindings. Thanks to
|
||
|
Åke Sandgren for providing a patch to fix the issue.
|
||
|
- Fix issue with MPI_SIZEOF when using CHARACTER and LOGICAL types in
|
||
|
the mpi module. Thanks to Åke Sandgren for providing a patch to fix
|
||
|
the issue.
|
||
|
|
||
|
|
||
|
1.6.2: 25 Sep 2012
|
||
|
------------------
|
||
|
|
||
|
- Fix issue with MX MTL. Thanks to Doug Eadline for raising the issue.
|
||
|
- Fix singleton MPI_COMM_SPAWN when the result job spans multiple nodes.
|
||
|
- Fix MXM hang, and update for latest version of MXM.
|
||
|
- Update to support Mellanox FCA 2.5.
|
||
|
- Fix startup hang for large jobs.
|
||
|
- Ensure MPI_TESTANY / MPI_WAITANY properly set the empty status when
|
||
|
count==0.
|
||
|
- Fix MPI_CART_SUB behavior of not copying periods to the new
|
||
|
communicator properly. Thanks to John Craske for the bug report.
|
||
|
- Add btl_openib_abort_not_enough_reg_mem MCA parameter to cause Open
|
||
|
MPI to abort MPI jobs if there is not enough registered memory
|
||
|
available on the system (vs. just printing a warning). Thanks to
|
||
|
Brock Palen for raising the issue.
|
||
|
- Minor fix to Fortran MPI_INFO_GET: only copy a value back to the
|
||
|
user's buffer if the flag is .TRUE.
|
||
|
- Fix VampirTrace compilation issue with the PGI compiler suite.
|
||
|
|
||
|
|
||
|
1.6.1: 22 Aug 2012
|
||
|
------------------
|
||
|
|
||
|
- A bunch of changes to eliminate hangs on OpenFabrics-based networks.
|
||
|
Users with Mellanox hardware are ***STRONGLY ENCOURAGED*** to check
|
||
|
their registered memory kernel module settings to ensure that the OS
|
||
|
will allow registering more than 8GB of memory. See this FAQ item
|
||
|
for details:
|
||
|
|
||
|
http://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem
|
||
|
|
||
|
- Fall back to send/receive semantics if registered memory is
|
||
|
unavilable for RDMA.
|
||
|
- Fix two fragment leaks when registered memory is exhausted.
|
||
|
- Hueristically determine how much registered memory is available
|
||
|
and warn if it's significantly less than all of RAM.
|
||
|
- Artifically limit the amount of registered memory each MPI process
|
||
|
can use to about 1/Nth to total registered memory available.
|
||
|
- Improve error messages when events occur that are likely due to
|
||
|
unexpected registered memory exhaustion.
|
||
|
|
||
|
- Fix double semicolon error in the C++ in <mpi.h>. Thanks to John
|
||
|
Foster for pointing out the issue.
|
||
|
- Allow -Xclang to be specified multiple times in CFLAGS. Thanks to
|
||
|
P. Martin for raising the issue.
|
||
|
- Break up a giant "print *" statement in the ABI-preserving incorrect
|
||
|
MPI_SCATTER interface in the "large" Fortran "mpi" module. Thanks
|
||
|
to Juan Escobar for the initial patch.
|
||
|
- Switch the MPI_ALLTOALLV default algorithm to a pairwise exchange.
|
||
|
- Increase the openib BTL default CQ length to handle more types of
|
||
|
OpenFabrics devices.
|
||
|
- Lots of VampirTrace fixes; upgrade to v5.13.0.4.
|
||
|
- Map MPI_2INTEGER to underlying MPI_INTEGERs, not MPI_INTs.
|
||
|
- Ensure that the OMPI version number is toleant of handling spaces.
|
||
|
Thanks to dragonboy for identifying the issue.
|
||
|
- Fixed IN parameter marking on Fortran "mpi" module
|
||
|
MPI_COMM_TEST_INTER interface.
|
||
|
- Various MXM improvements.
|
||
|
- Make the output of "mpirun --report-bindings" much more friendly /
|
||
|
human-readable.
|
||
|
- Properly handle MPI_COMPLEX8|16|32.
|
||
|
- More fixes for mpirun's processor affinity options (--bind-to-core
|
||
|
and friends).
|
||
|
- Use aligned memory for OpenFabrics registered memory.
|
||
|
- Multiple fixes for parameter checking in MPI_ALLGATHERV,
|
||
|
MPI_REDUCE_SCATTER, MPI_SCATTERV, and MPI_GATHERV. Thanks to the
|
||
|
mpi4py community (Bennet Fauber, Lisandro Dalcín, Jonathan Dursi).
|
||
|
- Fixed file positioning overflows in MPI_FILE_GET_POSITION,
|
||
|
MPI_FILE_GET_POSITION_SHARED, FILE_GET_SIZE, FILE_GET_VIEW.
|
||
|
- Removed the broken --cpu-set mpirun option.
|
||
|
- Fix cleanup of MPI errorcodes. Thanks to Alexey Bayduraev for the
|
||
|
patch.
|
||
|
- Fix default hostfile location. Thanks to Götz Waschk for noticing
|
||
|
the issue.
|
||
|
- Improve several error messages.
|
||
|
|
||
|
|
||
|
1.6: 14 May 2012
|
||
|
----------------
|
||
|
|
||
|
- Fix some process affinity issues. When binding a process, Open MPI
|
||
|
will now bind to all available hyperthreads in a core (or socket,
|
||
|
depending on the binding options specified).
|
||
|
--> Note that "mpirun --bind-to-socket ..." does not work on POWER6-
|
||
|
and POWER7-based systems with some Linux kernel versions. See
|
||
|
the FAQ on the Open MPI web site for more information.
|
||
|
- Add support for ARM5 and ARM6 (in addition to the existing ARM7
|
||
|
support). Thanks to Evan Clinton for the patch.
|
||
|
- Minor Mellanox MXM fixes.
|
||
|
- Properly detect FDR10, FDR, and EDR OpenFabrics devices.
|
||
|
- Minor fixes to the mpirun(1) and MPI_Comm_create(3) man pages.
|
||
|
- Prevent segv if COMM_SPAWN_MULTIPLE fails. Thanks to Fujitsu for
|
||
|
the patch.
|
||
|
- Disable interposed memory management in fakeroot environments. This
|
||
|
fixes a problem in some build environments.
|
||
|
- Minor hwloc updates.
|
||
|
- Array versions of MPI_TEST and MPI_WAIT with a count==0 will now
|
||
|
return immediately with MPI_SUCCESS. Thanks to Jeremiah Willcock
|
||
|
for the suggestion.
|
||
|
- Update VampirTrace to v5.12.2.
|
||
|
- Properly handle forwarding stdin to all processes when "mpirun
|
||
|
--stdin all" is used.
|
||
|
- Workaround XLC assembly bug.
|
||
|
- OS X Tiger (10.4) has not been supported for a while, so forcibly
|
||
|
abort configure if we detect it.
|
||
|
- Fix segv in the openib BTL when running on SPARC 64 systems.
|
||
|
- Fix some include file ordering issues on some BSD-based platforms.
|
||
|
Thanks to Paul Hargove for this (and many, many other) fixes.
|
||
|
- Properly handle .FALSE. return parameter value to attribute copy
|
||
|
callback functions.
|
||
|
- Fix a bunch of minor C++ API issues; thanks to Fujitsu for the patch.
|
||
|
- Fixed the default hostfile MCA parameter behavior.
|
||
|
- Per the MPI spec, ensure not to touch the port_name parameter to
|
||
|
MPI_CLOSE_PORT (it's an IN parameter).
|
||
|
|
||
|
|
||
|
1.5.5: 27 Mar 2012
|
||
|
------------------
|
||
|
|
||
|
- Many, many portability configure/build fixes courtesy of Paul
|
||
|
Hargrove. Thanks, Paul!
|
||
|
- Fixed shared memory fault tolerance support compiler errors.
|
||
|
- Removed not-production-quality rshd and tmd PLM launchers.
|
||
|
- Minor updates to the Open MPI SRPM spec file.
|
||
|
- Fixed mpirun's --bind-to-socket option.
|
||
|
- A few MPI_THREAD_MULTIPLE fixes in the shared memory BTL.
|
||
|
- Upgrade the GNU Autotools used to bootstrap the 1.5/1.6 series to
|
||
|
all the latest versions at the time of this release.
|
||
|
- Categorically state in the README that if you're having a problem
|
||
|
with Open MPI with the Linux Intel 12.1 compilers, *upgrade your
|
||
|
Intel Compiler Suite to the latest patch version*, and the problems
|
||
|
will go away. :-)
|
||
|
- Fix the --without-memory-manager configure option.
|
||
|
- Fixes for Totalview/DDT MPI-capable debuggers.
|
||
|
- Update rsh/ssh support to properly handle the Mac OS X library path
|
||
|
(i.e., DYLD_LIBRARY_PATH).
|
||
|
- Make warning about shared memory backing files on a networked file
|
||
|
system be optional (i.e., can be disabled via MCA parameter).
|
||
|
- Several fixes to processor and memory affinity.
|
||
|
- Various shared memory infrastructure improvements.
|
||
|
- Various checkpoint/restart fixes.
|
||
|
- Fix MPI_IN_PLACE (and other MPI sentinel values) on OS X. Thanks to
|
||
|
Dave Goodell for providing the magic OS X gcc linker flags necessary.
|
||
|
- Various man page corrections and typo fixes. Thanks to Fujitsu for
|
||
|
the patch.
|
||
|
- Updated wrapper compiler man pages to list the various --showme
|
||
|
options that are available.
|
||
|
- Add PMI direct-launch support (e.g., "srun mpi_application" under
|
||
|
SLURM).
|
||
|
- Correctly compute the aligned address when packing the
|
||
|
datatype description. Thanks to Fujitsu for the patch.
|
||
|
- Fix MPI obscure corner case handling in packing MPI datatypes.
|
||
|
Thanks to Fujitsu for providing the patch.
|
||
|
- Workaround an Intel compiler v12.1.0 2011.6.233 vector optimization
|
||
|
bug.
|
||
|
- Output the MPI API in ompi_info output.
|
||
|
- Major VT update to 5.12.1.4.
|
||
|
- Upgrade embedded Hardware Locality (hwloc) v1.3.2, plus some
|
||
|
post-1.3.2-release bug fixes. All processor and memory binding is
|
||
|
now done through hwloc. Woo hoo! Note that this fixes core binding
|
||
|
on AMD Opteron 6200 and 4200 series-based systems (sometimes known
|
||
|
as Interlagos, Valencia, or other Bulldozer-based chips).
|
||
|
- New MCA parameters to control process-wide memory binding policy:
|
||
|
hwloc_base_mem_alloc_policy, hwloc_base_mem_bind_failure_action (see
|
||
|
ompi_info --param hwloc base).
|
||
|
- Removed direct support for libnuma. Libnuma support may now be
|
||
|
picked up through hwloc.
|
||
|
- Added MPI_IN_PLACE support to MPI_EXSCAN.
|
||
|
- Various fixes for building on Windows, including MinGW support.
|
||
|
- Removed support for the OpenFabrics IBCM connection manager.
|
||
|
- Updated Chelsio T4 and Intel NE OpenFabrics default buffer settings.
|
||
|
- Increased the default RDMA CM timeout to 30 seconds.
|
||
|
- Issue a warning if both btl_tcp_if_include and btl_tcp_if_exclude
|
||
|
are specified.
|
||
|
- Many fixes to the Mellanox MXM transport.
|
||
|
|
||
|
|
||
|
1.5.4: 18 Aug 2011
|
||
|
------------------
|
||
|
|
||
|
- Add support for the (as yet unreleased) Mellanox MXM transport.
|
||
|
- Add support for dynamic service levels (SLs) in the openib BTL.
|
||
|
- Fixed C++ bindings cosmetic/warnings issue with
|
||
|
MPI::Comm::NULL_COPY_FN and MPI::Comm::NULL_DELETE_FN. Thanks to
|
||
|
Júlio Hoffimann for identifying the issues.
|
||
|
- Also allow the word "slots" in rankfiles (i.e., not just "slot").
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Add Mellanox ConnectX 3 device IDs to the openib BTL defaults.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Various FCA updates.
|
||
|
- Fix 32 bit SIGBUS errors on Solaris SPARC platforms.
|
||
|
- Add missing ARM assembly code files.
|
||
|
- Update to allow more than 128 entries in an appfile.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Various VT updates and bug fixes.
|
||
|
- Update description of btl_openib_cq_size to be more accurate.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Various assembly "clobber" fixes.
|
||
|
- Fix a hang in carto selection in obscure situations.
|
||
|
- Guard the inclusion of execinfo.h since not all platforms have it. Thanks
|
||
|
to Aleksej Saushev for identifying this issue.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Support Solaris legacy munmap prototype changes.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Updated to Automake 1.11.1 per
|
||
|
http://www.open-mpi.org/community/lists/devel/2011/07/9492.php.
|
||
|
- Fix compilation of LSF support.
|
||
|
- Update MPI_Comm_spawn_multiple.3 man page to reflect what it
|
||
|
actually does.
|
||
|
- Fix for possible corruption of the environment. Thanks to Peter
|
||
|
Thompson for the suggestion. (** also to appear in 1.4.4)
|
||
|
- Enable use of PSM on direct-launch SLURM jobs.
|
||
|
- Update paffinity hwloc to v1.2, and to fix minor bugs affinity
|
||
|
assignment bugs on PPC64/Linux platforms.
|
||
|
- Let the openib BTL auto-detect its bandwidth.
|
||
|
- Support new MPI-2.2 datatypes.
|
||
|
- Updates to support more datatypes in MPI one-sided communication.
|
||
|
- Fix recursive locking bug when MPI-IO was used with
|
||
|
MPI_THREAD_MULTIPLE. (** also to appear in 1.4.4)
|
||
|
- Fix mpirun handling of prefix conflicts.
|
||
|
- Ensure mpirun's --xterm options leaves sessions attached.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Fixed type of sendcounts and displs in the "use mpi" F90 module.
|
||
|
ABI is preserved, but applications may well be broken. See the
|
||
|
README for more details. Thanks to Stanislav Sazykin for
|
||
|
identifying the issue. (** also to appear in 1.4.4)
|
||
|
- Fix indexed datatype leaks. Thanks to Pascal Deveze for supplying
|
||
|
the initial patch. (** also to appear in 1.4.4)
|
||
|
- Fix debugger mapping when mpirun's -npernode option is used.
|
||
|
- Fixed support for configure's --disable-dlopen option when used with
|
||
|
"make distclean".
|
||
|
- Fix segv associated with MPI_Comm_create with MPI_GROUP_EMPTY.
|
||
|
Thanks to Dominik Goeddeke for finding this.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Improved LoadLeveler ORTE support.
|
||
|
- Add new WinVerbs BTL plugin, supporting native OpenFabrics verbs on
|
||
|
Windows (the "wv" BTL).
|
||
|
- Add new btl_openib_gid_index MCA parameter to allow selecting which
|
||
|
GID to use on an OpenFabrics device's GID table.
|
||
|
- Add support for PCI relaxed ordering in the OpenFabrics BTL (when
|
||
|
available).
|
||
|
- Update rsh logic to allow correct SGE operation.
|
||
|
- Ensure that the mca_paffinity_alone MCA parameter only appears once
|
||
|
in the ompi_info output. Thanks to Gus Correa for identifying the
|
||
|
issue.
|
||
|
- Fixed return codes from MPI_PROBE and MPI_IPROBE.
|
||
|
(** also to appear in 1.4.4)
|
||
|
- Remove --enable-progress-thread configure option; it doesn't work on
|
||
|
the v1.5 branch. Rename --enable-mpi-threads to
|
||
|
--enable-mpi-thread-multiple. Add new --enable-opal-multi-threads
|
||
|
option.
|
||
|
- Updates for Intel Fortran compiler version 12.
|
||
|
- Remove bproc support. Farewell bproc!
|
||
|
- If something goes wrong during MPI_INIT, fix the error
|
||
|
message to say that it's illegal to invoke MPI_INIT before
|
||
|
MPI_INIT.
|
||
|
|
||
|
|
||
|
1.5.3: 16 Mar 2011
|
||
|
------------------
|
||
|
|
||
|
- Add missing "affinity" MPI extension (i.e., the OMPI_Affinity_str()
|
||
|
API) that was accidentally left out of the 1.5.2 release.
|
||
|
|
||
|
|
||
|
1.5.2: 9 Mar 2011
|
||
|
-----------------
|
||
|
|
||
|
- Replaced all custom topology / affinity code with initial support
|
||
|
for hwloc v1.1.1 (PLPA has been removed -- long live hwloc!). Note
|
||
|
that hwloc is bundled with Open MPI, but an external hwloc can be
|
||
|
used, if desired. See README for more details.
|
||
|
- Many CMake updates for Windows builds.
|
||
|
- Updated opal_cr_thread_sleep_wait MCA param default value to make it
|
||
|
less aggressive.
|
||
|
- Updated debugger support to allow Totalview attaching from jobs
|
||
|
launched directly via srun (not mpirun). Thanks to Nikolay Piskun
|
||
|
for the patch.
|
||
|
- Added more FTB/CIFTS support.
|
||
|
- Fixed compile error with the PGI compiler.
|
||
|
- Portability fixes to allow the openib BTL to run on the Solaris
|
||
|
verbs stack.
|
||
|
- Fixed multi-token command-line issues when using the mpirun
|
||
|
--debug switch. For example:
|
||
|
mpirun --debug -np 2 a.out "foo bar"
|
||
|
Thanks to Gabriele Fatigati for reporting the issue.
|
||
|
- Added ARM support.
|
||
|
- Added the MPI_ROOT environment variable in the Open MPI Linux SRPM
|
||
|
for customers who use the BPS and LSF batch managers.
|
||
|
- Updated ROMIO from MPICH v1.3.1 (plus one additional patch).
|
||
|
- Fixed some deprecated MPI API function notification messages.
|
||
|
- Added new "bfo" PML that provides failover on OpenFabrics networks.
|
||
|
- Fixed some buffer memcheck issues in MPI_*_init.
|
||
|
- Added Solaris-specific chip detection and performance improvements.
|
||
|
- Fix some compile errors on Solaris.
|
||
|
- Updated the "rmcast" framework with bug fixes, new functionality.
|
||
|
- Updated the Voltaire FCA component with bug fixes, new
|
||
|
functionality. Support for FCA version 2.1.
|
||
|
- Fix gcc 4.4.x and 4.5.x over-aggressive warning notifications on
|
||
|
possibly freeing stack variables. Thanks to the Gentoo packagers
|
||
|
for reporting the issue.
|
||
|
- Make the openib component be verbose when it disqualifies itself due
|
||
|
to MPI_THREAD_MULTIPLE.
|
||
|
- Minor man page fixes.
|
||
|
- Various checkpoint / restart fixes.
|
||
|
- Fix race condition in the one-sided unlock code. Thanks to
|
||
|
Guillaume Thouvenin for finding the issue.
|
||
|
- Improve help message aggregation.
|
||
|
- Add OMPI_Affinity_str() optional user-level API function (i.e., the
|
||
|
"affinity" MPI extension). See README for more details.
|
||
|
- Added btl_tcp_if_seq MCA parameter to select a different ethernet
|
||
|
interface for each MPI process on a node. This parameter is only
|
||
|
useful when used with virtual ethernet interfaces on a single
|
||
|
network card (e.g., when using virtual interfaces give dedicated
|
||
|
hardware resources on the NIC to each process).
|
||
|
- Changed behavior of mpirun to terminate if it receives 10 (or more)
|
||
|
SIGPIPEs.
|
||
|
- Fixed oversubscription detection.
|
||
|
- Added new mtl_mx_board and mtl_mx_endpoint MCA parameters.
|
||
|
- Added ummunotify support for OpenFabrics-based transports. See the
|
||
|
README for more details.
|
||
|
|
||
|
|
||
|
1.5.1: 15 Dec 2010
|
||
|
------------------
|
||
|
|
||
|
- Fixes for the Oracle Studio 12.2 Fortran compiler.
|
||
|
- Fix SPARC and SPARCv9 atomics. Thanks to Nicola Stange for the
|
||
|
initial patch.
|
||
|
- Fix Libtool issues with the IBM XL compiler in 64-bit mode.
|
||
|
- Restore the reset of the libevent progress counter to avoid
|
||
|
over-sampling the event library.
|
||
|
- Update memory barrier support.
|
||
|
- Use memmove (instead of memcpy) when necessary (e.g., source and
|
||
|
destination overlap).
|
||
|
- Fixed ompi-top crash.
|
||
|
- Fix to handle Autoconf --program-transforms properly and other
|
||
|
m4/configury updates. Thanks to the GASNet project for the
|
||
|
--program transforms fix.
|
||
|
- Allow hostfiles to specify usernames on a per-host basis.
|
||
|
- Update wrapper compiler scripts to search for perl during configure,
|
||
|
per request from the BSD maintainers.
|
||
|
- Minor man page fixes.
|
||
|
- Added --with-libltdl option to allow building Open MPI with an
|
||
|
external installation of libltdl.
|
||
|
- Fixed various issues with -D_FORTIFY_SOURCE=2.
|
||
|
- Various VT fixes and updates.
|
||
|
|
||
|
|
||
|
1.5: 10 Oct 2010
|
||
|
----------------
|
||
|
|
||
|
- Added "knem" support: direct process-to-process copying for shared
|
||
|
memory message passing. See http://runtime.bordeaux.inria.fr/knem/
|
||
|
and the README file for more details.
|
||
|
- Updated shared library versioning scheme and linking style of MPI
|
||
|
applications. The MPI application ABI has been broken from the
|
||
|
v1.3/v1.4 series. MPI applications compiled against any prior
|
||
|
version of Open MPI will need to, at a minimum, re-link. See the
|
||
|
README file for more details.
|
||
|
- Added "fca" collective component, enabling MPI collective offload
|
||
|
support for Voltaire switches.
|
||
|
- Fixed MPI one-sided operations with large target displacements.
|
||
|
Thanks to Brian Price and Jed Brown for reporting the issue.
|
||
|
- Fixed MPI_GET_COUNT when used with large counts. Thanks to Jed
|
||
|
Brown for reporting the issue.
|
||
|
- Made the openib BTL safer if extremely low SRQ settings are used.
|
||
|
- Fixed handling of the array_of_argv parameter in the Fortran
|
||
|
binding of MPI_COMM_SPAWN_MULTIPLE (** also to appear: 1.4.3).
|
||
|
- Fixed malloc(0) warnings in some collectives.
|
||
|
- Fixed a problem with the Fortran binding for
|
||
|
MPI_FILE_CREATE_ERRHANDLER. Thanks to Secretan Yves for identifying
|
||
|
the issue (** also to appear: 1.4.3).
|
||
|
- Updates to the LSF PLM to ensure that the path is correctly passed.
|
||
|
Thanks to Teng Lin for the patch (** also to appear: 1.4.3).
|
||
|
- Fixes for the F90 MPI_COMM_SET_ERRHANDLER and MPI_WIN_SET_ERRHANDLER
|
||
|
bindings. Thanks to Paul Kapinos for pointing out the issue
|
||
|
(** also to appear: 1.4.3).
|
||
|
- Fixed extra_state parameter types in F90 prototypes for
|
||
|
MPI_COMM_CREATE_KEYVAL, MPI_GREQUEST_START, MPI_REGISTER_DATAREP,
|
||
|
MPI_TYPE_CREATE_KEYVAL, and MPI_WIN_CREATE_KEYVAL.
|
||
|
- Fixes for Solaris oversubscription detection.
|
||
|
- If the PML determines it can't reach a peer process, print a
|
||
|
slightly more helpful message. Thanks to Nick Edmonds for the
|
||
|
suggestion.
|
||
|
- Make btl_openib_if_include/exclude function the same way
|
||
|
btl_tcp_if_include/exclude works (i.e., supplying an _include list
|
||
|
overrides supplying an _exclude list).
|
||
|
- Apply more scalable reachability algorithm on platforms with more
|
||
|
than 8 TCP interfaces.
|
||
|
- Various assembly code updates for more modern platforms / compilers.
|
||
|
- Relax restrictions on using certain kinds of MPI datatypes with
|
||
|
one-sided operations. Users beware; not all MPI datatypes are valid
|
||
|
for use with one-sided operations!
|
||
|
- Improve behavior of MPI_COMM_SPAWN with regards to --bynode.
|
||
|
- Various threading fixes in the openib BTL and other core pieces of
|
||
|
Open MPI.
|
||
|
- Various help file and man pages updates.
|
||
|
- Various FreeBSD and NetBSD updates and fixes. Thanks to Kevin
|
||
|
Buckley and Aleksej Saushev for their work.
|
||
|
- Fix case where freeing communicators in MPI_FINALIZE could cause
|
||
|
process failures.
|
||
|
- Print warnings if shared memory state files are opened on what look
|
||
|
like networked filesystems.
|
||
|
- Update libevent to v1.4.13.
|
||
|
- Allow propagating signals to processes that call fork().
|
||
|
- Fix bug where MPI_GATHER was sometimes incorrectly examining the
|
||
|
datatype on non-root processes. Thanks to Michael Hofmann for
|
||
|
investigating the issue.
|
||
|
- Various Microsoft Windows fixes.
|
||
|
- Various Catamount fixes.
|
||
|
- Various checkpoint / restart fixes.
|
||
|
- Xgrid support has been removed until it can be fixed (patches
|
||
|
would be welcome).
|
||
|
- Added simplistic "libompitrace" contrib package. Using the MPI
|
||
|
profiling interface, it essentially prints out to stderr when select
|
||
|
MPI functions are invoked.
|
||
|
- Update bundled VampirTrace to v5.8.2.
|
||
|
- Add pkg-config(1) configuration files for ompi, ompi-c, ompi-cxx,
|
||
|
ompi-f77, ompi-f90. See the README for more details.
|
||
|
- Removed the libopenmpi_malloc library (added in the v1.3 series)
|
||
|
since it is no longer necessary
|
||
|
- Add several notifier plugins (generally used when Open MPI detects
|
||
|
system/network administrator-worthy problems); each have their own
|
||
|
MCA parameters to govern their usage. See "ompi_info --param
|
||
|
notifier <name>" for more details.
|
||
|
- command to execute arbitrary commands (e.g., run a script).
|
||
|
- file to send output to a file.
|
||
|
- ftb to send output to the Fault Tolerant Backplane (see
|
||
|
http://wiki.mcs.anl.gov/cifts/index.php/CIFTS)
|
||
|
- hnp to send the output to mpirun.
|
||
|
- smtp (requires libesmtp) to send an email.
|
||
|
|
||
|
1.4.5: 12 Feb 2012
|
||
|
------------------
|
||
|
|
||
|
- Fixed the --disable-memory-manager configure switch.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Fix typos in code and man pages. Thanks to Fujitsu for these fixes.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Improve management of the registration cache; when full, try freeing
|
||
|
old entries and attempt to re-register.
|
||
|
- Fixed a data packing pointer alignment issue. Thanks to Fujitsu
|
||
|
for the patch.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Add ability to turn off warning about having the shared memory backing
|
||
|
store over a networked filesystem. Thanks to Chris Samuel for this
|
||
|
suggestion.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Removed an unnecessary memmove() and plugged a couple of small memory leaks
|
||
|
in the openib OOB connection setup code.
|
||
|
- Fixed some QLogic bugs. Thanks to Mark Debbage from QLogic for the patches.
|
||
|
- Fixed problem with MPI_IN_PLACE and other sentinel Fortran constants
|
||
|
on OS X.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Fix SLURM cpus-per-task allocation.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Fix the datatype engine for when data left over from the previous
|
||
|
pack was larger than the allowed space in the pack buffer. Thanks to
|
||
|
Yuki Matsumoto and Takahiro Kawashima for the bug report and the
|
||
|
patch.
|
||
|
- Fix Fortran value for MPI_MAX_PORT_NAME. Thanks to Enzo Dari for
|
||
|
raising the issue.
|
||
|
- Workaround an Intel compiler v12.1.0 2011.6.233 vector optimization
|
||
|
bug.
|
||
|
- Fix issues on Solaris with the openib BTL.
|
||
|
- Fixes for the Oracle Studio 12.2 Fortran compiler.
|
||
|
- Update iWARP parameters for the Intel NICs.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Fix obscure cases where MPI_ALLGATHER could crash. Thanks to Andrew
|
||
|
Senin for reporting the problem.
|
||
|
(** also to appear in 1.5.5)
|
||
|
|
||
|
|
||
|
1.4.4: 11 Oct 2011
|
||
|
------------------
|
||
|
|
||
|
- Modified a memcpy() call in the openib btl connection setup to use
|
||
|
memmove() instead because of the possibility of an overlapping
|
||
|
copy (as identified by valgrind).
|
||
|
- Changed use of sys_timer_get_cycles() to the more appropriate
|
||
|
wrapper: opal_timer_base_get_cycles(). Thanks to Jani Monoses
|
||
|
for this fix.
|
||
|
- Corrected the reported default value of btl_openib_ib_timeout
|
||
|
in the "IB retries exceeded" error message. Thanks to Kevin Buckley
|
||
|
for this correction.
|
||
|
- Increased rdmacm address resolution timeout from 1s to 30s &
|
||
|
updated Chelsio T4 openib BTL defaults. Thanks to Steve Wise
|
||
|
for these updates.
|
||
|
(** also to appear in 1.5.5)
|
||
|
- Ensure that MPI_Accumulate error return in 1.4 is consistent with
|
||
|
1.5.x and trunk.
|
||
|
- Allow the word "slots" in rankfiles (i.e., not just "slot").
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Add Mellanox ConnectX 3 device IDs to the openib BTL defaults.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Update description of btl_openib_cq_size to be more accurate.
|
||
|
- Ensure mpirun's --xterm options leaves sessions attached.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Update to allow more than 128 entries in an appfile.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Update description of btl_openib_cq_size to be more accurate.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Fix for deadlock when handling recursive attribute keyval deletions
|
||
|
(e.g., when using ROMIO with MPI_THREAD_MULTIPLE).
|
||
|
- Fix indexed datatype leaks. Thanks to Pascal Deveze for supplying
|
||
|
the initial patch. (** also appeared in 1.5.4)
|
||
|
- Fixed the F90 types of the sendcounts and displs parameters to
|
||
|
MPI_SCATTERV. Thanks to Stanislav Sazykin for identifying the issue.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Exclude opal/libltdl from "make distclean" when --disable-dlopen is
|
||
|
used. Thanks to David Gunter for reporting the issue.
|
||
|
- Fixed a segv in MPI_Comm_create when called with GROUP_EMPTY.
|
||
|
Thanks to Dominik Goeddeke for finding this.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Fixed return codes from MPI_PROBE and MPI_IPROBE.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Fixed undefined symbol error when using the vtf90 profiling tool.
|
||
|
- Fix for referencing an uninitialized variable in DPM ORTE. Thanks
|
||
|
to Avinash Malik for reporting the issue.
|
||
|
- Fix for correctly handling multi-token args when using debuggers.
|
||
|
- Eliminated the unneeded u_int*_t datatype definitions.
|
||
|
- Change in ORTE DPM to get around gcc 4.[45].x compiler wanrings
|
||
|
about possibly calling free() on a non-heap variable, even though it
|
||
|
will never happen because the refcount will never go to zero.
|
||
|
- Fixed incorrect text in MPI_File_set_view man page.
|
||
|
- Fix in MPI_Init_thread for checkpoint/restart.
|
||
|
- Fix for libtool issue when using pgcc to compile ompi in conjunction
|
||
|
with the -tp option.
|
||
|
- Fixed a race condition in osc_rdma_sync. Thanks to Guillaume
|
||
|
Thouvenin for finding this issue.
|
||
|
- Clarification of MPI_Init_thread man page.
|
||
|
- Fixed an indexing problem in precondition_transports.
|
||
|
- Fixed a problem in which duplicated libs were being specified for
|
||
|
linking. Thanks to Hicham Mouline for noticing it.
|
||
|
- Various autogen.sh fixes.
|
||
|
- Fix for memchecking buffers during MPI_*INIT.
|
||
|
- Man page cleanups. Thanks to Jeremiah Willcock and Jed Brown.
|
||
|
- Fix for VT rpmbuild on RHEL5.
|
||
|
- Support Solaris legacy munmap prototype changes.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Expands app_idx to int32_t to allow more than 127 app_contexts.
|
||
|
- Guard the inclusion of execinfo.h since not all platforms have it. Thanks
|
||
|
to Aleksej Saushev for identifying this issue.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Fix to avoid possible environment corruption. Thanks to Peter Thompson
|
||
|
for identifying the issue and supplying a patch.
|
||
|
(** also appeared in 1.5.4)
|
||
|
- Fixed paffinity base MCA duplicate registrations. Thanks to Gus
|
||
|
Correa for bringing this to our attention.
|
||
|
- Fix recursive locking bug when MPI-IO was used with
|
||
|
MPI_THREAD_MULTIPLE. (** also appeared in 1.5.4)
|
||
|
- F90 MPI API fixes.
|
||
|
- Fixed a misleading MPI_Bcast error message. Thanks to Jeremiah
|
||
|
Willcock for reporting this.
|
||
|
- Added <sys/stat.h> to ptmalloc's hooks.c (it's not always included
|
||
|
by default on some systems).
|
||
|
- Libtool patch to get around a build problem when using the IBM XL
|
||
|
compilers.
|
||
|
- Fix to detect and avoid overlapping memcpy(). Thanks to
|
||
|
Francis Pellegrini for identifying the issue.
|
||
|
- Fix to allow ompi to work on top of RoCE vLANs.
|
||
|
- Restored a missing debugger flag to support TotalView. Thanks to
|
||
|
David Turner and the TV folks for supplying the fix.
|
||
|
- Updated SLURM support to 1.5.1.
|
||
|
- Removed an extraneous #include from the TCP BTL.
|
||
|
- When specifying OOB ports, fix to convert the ports into network
|
||
|
byte order before binding.
|
||
|
- Fixed use of memory barriers in the SM BTL. This fixed segv's when
|
||
|
compiling with Intel 10.0.025 or PGI 9.0-3.
|
||
|
- Fix to prevent the SM BTL from creating its mmap'd file in
|
||
|
directories that are remotely mounted.
|
||
|
|
||
|
|
||
|
1.4.3: 6 Sep 2010
|
||
|
-----------------
|
||
|
|
||
|
- Fixed handling of the array_of_argv parameter in the Fortran
|
||
|
binding of MPI_COMM_SPAWN_MULTIPLE (** also to appear: 1.5).
|
||
|
- Fixed a problem with the Fortran binding for
|
||
|
MPI_FILE_CREATE_ERRHANDLER. Thanks to Secretan Yves for identifying
|
||
|
the issue (** also to appear: 1.5).
|
||
|
- Updates to the LSF PLM to ensure that the path is correctly passed.
|
||
|
Thanks to Teng Lin for the patch (** also to appear: 1.5).
|
||
|
- Fixes for the F90 MPI_COMM_SET_ERRHANDLER and MPI_WIN_SET_ERRHANDLER
|
||
|
bindings. Thanks to Paul Kapinos for pointing out the issue.
|
||
|
(** also to appear: 1.5).
|
||
|
- Fixed various MPI_THREAD_MULTIPLE race conditions.
|
||
|
- Fixed an issue with an undeclared variable from ptmalloc2 munmap on
|
||
|
BSD systems.
|
||
|
- Fixes for BSD interface detection.
|
||
|
- Various other BSD fixes. Thanks to Kevin Buckley helping to track.
|
||
|
all of this down.
|
||
|
- Fixed issues with the use of the -nper* mpirun command line arguments.
|
||
|
- Fixed an issue with coll tuned dynamic rules.
|
||
|
- Fixed an issue with the use of OPAL_DESTDIR being applied too aggressively.
|
||
|
- Fixed an issue with one-sided xfers when the displacement exceeds 2GBytes.
|
||
|
- Change to ensure TotalView works properly on Darwin.
|
||
|
- Added support for Visual Studio 2010.
|
||
|
- Fix to ensure proper placement of VampirTrace header files.
|
||
|
- Needed to add volatile keyword to a varialbe used in debugging
|
||
|
(MPIR_being_debugged).
|
||
|
- Fixed a bug in inter-allgather.
|
||
|
- Fixed malloc(0) warnings.
|
||
|
- Corrected a typo the MPI_Comm_size man page (intra -> inter). Thanks
|
||
|
to Simon number.cruncher for pointing this out.
|
||
|
- Fixed a SegV in orted when given more than 127 app_contexts.
|
||
|
- Removed xgrid source code from the 1.4 branch since it is no longer
|
||
|
supported in the 1.4 series.
|
||
|
- Removed the --enable-opal-progress-threads config option since
|
||
|
opal progress thread support does not work in 1.4.x.
|
||
|
- Fixed a defect in VampirTrace's vtfilter.
|
||
|
- Fixed wrong Windows path in hnp_contact.
|
||
|
- Removed the requirement for a paffinity component.
|
||
|
- Removed a hardcoded limit of 64 interconnected jobs.
|
||
|
- Fix to allow singletons to use ompi-server for rendezvous.
|
||
|
- Fixed bug in output-filename option.
|
||
|
- Fix to correctly handle failures in mx_init().
|
||
|
- Fixed a potential Fortran memory leak.
|
||
|
- Fixed an incorrect branch in some ppc32 assembly code. Thanks
|
||
|
to Matthew Clark for this fix.
|
||
|
- Remove use of undocumented AS_VAR_GET macro during configuration.
|
||
|
- Fixed an issue with VampirTrace's wrapper for MPI_init_thread.
|
||
|
- Updated mca-btl-openib-device-params.ini file with various new vendor id's.
|
||
|
- Configuration fixes to ensure CPPFLAGS in handled properly if a non-standard
|
||
|
valgrind location was specified.
|
||
|
- Various man page updates
|
||
|
|
||
|
|
||
|
1.4.2: 4 May 2010
|
||
|
-----------------
|
||
|
|
||
|
- Fixed problem when running in heterogeneous environments. Thanks to
|
||
|
Timur Magomedov for helping to track down this issue.
|
||
|
- Update LSF support to ensure that the path is passed correctly.
|
||
|
Thanks to Teng Lin for submitting a patch.
|
||
|
- Fixed some miscellaneous oversubscription detection bugs.
|
||
|
- IBM re-licensed its LoadLeveler code to be BSD-compliant.
|
||
|
- Various OpenBSD and NetBSD build and run-time fixes. Many thanks to
|
||
|
the OpenBSD community for their time, expertise, and patience
|
||
|
getting these fixes incorporated into Open MPI's main line.
|
||
|
- Various fixes for multithreading deadlocks, race conditions, and
|
||
|
other nefarious things.
|
||
|
- Fixed ROMIO's handling of "nearly" contiguous issues (e.g., with
|
||
|
non-zero true_lb). Thanks for Pascal Deveze for the patch.
|
||
|
- Bunches of Windows build fixes. Many thanks to several Windows
|
||
|
users for their help in improving our support on Windows.
|
||
|
- Now allow the graceful failover from MTLs to BTLs if no MTLs can
|
||
|
initialize successfully.
|
||
|
- Added "clobber" information to various atomic operations, fixing
|
||
|
erroneous behavior in some newer versions of the GNU compiler suite.
|
||
|
- Update various iWARP and InfiniBand device specifications in the
|
||
|
OpenFabrics .ini support file.
|
||
|
- Fix the use of hostfiles when a username is supplied.
|
||
|
- Various fixes for rankfile support.
|
||
|
- Updated the internal version of VampirTrace to 5.4.12.
|
||
|
- Fixed OS X TCP wireup issues having to do with IPv4/IPv6 confusion
|
||
|
(see https://svn.open-mpi.org/trac/ompi/changeset/22788 for more
|
||
|
details).
|
||
|
- Fixed some problems in processor affinity support, including when
|
||
|
there are "holes" in the processor namespace (e.g., offline
|
||
|
processors).
|
||
|
- Ensure that Open MPI's "session directory" (usually located in /tmp)
|
||
|
is cleaned up after process termination.
|
||
|
- Fixed some problems with the collective "hierarch" implementation
|
||
|
that could occur in some obscure conditions.
|
||
|
- Various MPI_REQUEST_NULL, API parameter checking, and attribute
|
||
|
error handling fixes. Thanks to Lisandro Dalcín for reporting the
|
||
|
issues.
|
||
|
- Fix case where MPI_GATHER erroneously used datatypes on non-root
|
||
|
nodes. Thanks to Michael Hofmann for investigating the issue.
|
||
|
- Patched ROMIO support for PVFS2 > v2.7 (patch taken from MPICH2
|
||
|
version of ROMIO).
|
||
|
- Fixed "mpirun --report-bindings" behavior when used with
|
||
|
mpi_paffinity_alone=1. Also fixed mpi_paffinity_alone=1 behavior
|
||
|
with non-MPI applications. Thanks to Brice Goglin for noticing the
|
||
|
problem.
|
||
|
- Ensure that all OpenFabrics devices have compatible receive_queues
|
||
|
specifications before allowing them to communicate. See the lengthy
|
||
|
comment in https://svn.open-mpi.org/trac/ompi/changeset/22592 for
|
||
|
more details.
|
||
|
- Fix some issues with checkpoint/restart.
|
||
|
- Improve the pre-MPI_INIT/post-MPI_FINALIZE error messages.
|
||
|
- Ensure that loopback addresses are never advertised to peer
|
||
|
processes for RDMA/OpenFabrics support.
|
||
|
- Fixed a CSUM PML false positive.
|
||
|
- Various fixes for Catamount support.
|
||
|
- Minor update to wrapper compilers in how user-specific argv is
|
||
|
ordered on the final command line. Thanks to Jed Brown for the
|
||
|
suggestions.
|
||
|
- Removed flex.exe binary from Open MPI tarballs; now generate flex
|
||
|
code from a newer (Windows-friendly) flex when we make official
|
||
|
tarballs.
|
||
|
|
||
|
|
||
|
1.4.1: 15 Jan 2010
|
||
|
------------------
|
||
|
|
||
|
- Update to PLPA v1.3.2, addressing a licensing issue identified by
|
||
|
the Fedora project. See
|
||
|
https://svn.open-mpi.org/trac/plpa/changeset/262 for details.
|
||
|
- Add check for malformed checkpoint metadata files (Ticket #2141).
|
||
|
- Fix error path in ompi-checkpoint when not able to checkpoint
|
||
|
(Ticket #2138).
|
||
|
- Cleanup component release logic when selecting checkpoint/restart
|
||
|
enabled components (Ticket #2135).
|
||
|
- Fixed VT node name detection for Cray XT platforms, and fixed some
|
||
|
broken VT documentation files.
|
||
|
- Fix a possible race condition in tearing down RDMA CM-based
|
||
|
connections.
|
||
|
- Relax error checking on MPI_GRAPH_CREATE. Thanks to David Singleton
|
||
|
for pointing out the issue.
|
||
|
- Fix a shared memory "hang" problem that occurred on x86/x86_64
|
||
|
platforms when used with the GNU >=4.4.x compiler series.
|
||
|
- Add fix for Libtool 2.2.6b's problems with the PGI 10.x compiler
|
||
|
suite. Inspired directly from the upstream Libtool patches that fix
|
||
|
the issue (but we need something working before the next Libtool
|
||
|
release).
|
||
|
|
||
|
|
||
|
1.4: 8 Dec 2009
|
||
|
---------------
|
||
|
|
||
|
The *only* change in the Open MPI v1.4 release (as compared to v1.3.4)
|
||
|
was to update the embedded version of Libtool's libltdl to address a
|
||
|
potential security vulnerability. Specifically: Open MPI v1.3.4 was
|
||
|
created with GNU Libtool 2.2.6a; Open MPI v1.4 was created with GNU
|
||
|
Libtool 2.2.6b. There are no other changes between Open MPI v1.3.4
|
||
|
and v1.4.
|
||
|
|
||
|
|
||
|
1.3.4: 13 Feb 2010
|
||
|
------------------
|
||
|
|
||
|
- Fix some issues in OMPI's SRPM with regard to shell_scripts_basename
|
||
|
and its use with mpi-selector. Thanks to Bill Johnstone for
|
||
|
pointing out the problem.
|
||
|
- Added many new MPI job process affinity options to mpirun. See the
|
||
|
newly-updated mpirun(1) man page for details.
|
||
|
- Several updates to mpirun's XML output.
|
||
|
- Update to fix a few Valgrind warnings with regards to the ptmalloc2
|
||
|
allocator and Open MPI's use of PLPA.
|
||
|
- Many updates and fixes to the (non-default) "sm" collective
|
||
|
component (i.e., native shared memory MPI collective operations).
|
||
|
- Updates and fixes to some MPI_COMM_SPAWN_MULTIPLE corner cases.
|
||
|
- Fix some internal copying functions in Open MPI's use of PLPA.
|
||
|
- Correct some SLURM nodelist parsing logic that may have interfered
|
||
|
with large jobs. Additionally, per advice from the SLURM team,
|
||
|
change the environment variable that we use for obtaining the job's
|
||
|
allocation.
|
||
|
- Revert to an older, safer (but slower) communicator ID allocation
|
||
|
algorithm.
|
||
|
- Fixed minimum distance finding for OpenFabrics devices in the openib
|
||
|
BTL.
|
||
|
- Relax the parameter checking MPI_CART_CREATE a bit.
|
||
|
- Fix MPI_COMM_SPAWN[_MULTIPLE] to only error-check the info arguments
|
||
|
on the root process. Thanks to Federico Golfre Andreasi for
|
||
|
reporting the problem.
|
||
|
- Fixed some BLCR configure issues.
|
||
|
- Fixed a potential deadlock when the openib BTL was used with
|
||
|
MPI_THREAD_MULTIPLE.
|
||
|
- Fixed dynamic rules selection for the "tuned" coll component.
|
||
|
- Added a launch progress meter to mpirun (useful for large jobs; set
|
||
|
the orte_report_launch_progress MCA parameter to 1 to see it).
|
||
|
- Reduced the number of file descriptors consumed by each MPI process.
|
||
|
- Add new device IDs for Chelsio T3 RNICs to the openib BTL config file.
|
||
|
- Fix some CRS self component issues.
|
||
|
- Added some MCA parameters to the PSM MTL to tune its run-time
|
||
|
behavior.
|
||
|
- Fix some VT issues with MPI_BOTTOM/MPI_IN_PLACE.
|
||
|
- Man page updates from the Debain Open MPI package maintainers.
|
||
|
- Add cycle counter support for the Alpha and Sparc platforms.
|
||
|
- Pass visibility flags to libltdl's configure script, resulting in
|
||
|
those symbols being hidden. This appears to mainly solve the
|
||
|
problem of applications attempting to use different versions of
|
||
|
libltdl from that used to build Open MPI.
|
||
|
|
||
|
|
||
|
1.3.3: 14 Jul 2009
|
||
|
------------------
|
||
|
|
||
|
- Fix a number of issues with the openib BTL (OpenFabrics) RDMA CM,
|
||
|
including a memory corruption bug, a shutdown deadlock, and a route
|
||
|
timeout. Thanks to David McMillen and Hal Rosenstock for help in
|
||
|
tracking down the issues.
|
||
|
- Change the behavior of the EXTRA_STATE parameter that is passed to
|
||
|
Fortran attribute callback functions: this value is now stored
|
||
|
internally in MPI -- it no longer references the original value
|
||
|
passed by MPI_*_CREATE_KEYVAL.
|
||
|
- Allow the overriding RFC1918 and RFC3330 for the specification of
|
||
|
"private" networks, thereby influencing Open MPI's TCP
|
||
|
"reachability" computations.
|
||
|
- Improve flow control issues in the sm btl, by both tweaking the
|
||
|
shared memory progression rules and by enabling the "sync" collective
|
||
|
to barrier every 1,000th collective.
|
||
|
- Various fixes for the IBM XL C/C++ v10.1 compiler.
|
||
|
- Allow explicit disabling of ptmalloc2 hooks at runtime (e.g., enable
|
||
|
support for Debian's builtroot system). Thanks to Manuel Prinz and
|
||
|
the rest of the Debian crew for helping identify and fix this issue.
|
||
|
- Various minor fixes for the I/O forwarding subsystem.
|
||
|
- Big endian iWARP fixes in the Open Fabrics RDMA CM support.
|
||
|
- Update support for various OpenFabrics devices in the openib BTL's
|
||
|
.ini file.
|
||
|
- Fixed undefined symbol issue with Open MPI's parallel debugger
|
||
|
message queue support so it can be compiled by Sun Studio compilers.
|
||
|
- Update MPI_SUBVERSION to 1 in the Fortran bindings.
|
||
|
- Fix MPI_GRAPH_CREATE Fortran 90 binding.
|
||
|
- Fix MPI_GROUP_COMPARE behavior with regards to MPI_IDENT. Thanks to
|
||
|
Geoffrey Irving for identifying the problem and supplying the fix.
|
||
|
- Silence gcc 4.1 compiler warnings about type punning. Thanks to
|
||
|
Number Cruncher for the fix.
|
||
|
- Added more Valgrind and other memory-cleanup fixes. Thanks to
|
||
|
various Open MPI users for help with these issues.
|
||
|
- Miscellaneous VampirTrace fixes.
|
||
|
- More fixes for openib credits in heavy-congestion scenarios.
|
||
|
- Slightly decrease the latency in the openib BTL in some conditions
|
||
|
(add "send immediate" support to the openib BTL).
|
||
|
- Ensure to allow MPI_REQUEST_GET_STATUS to accept an
|
||
|
MPI_STATUS_IGNORE parameter. Thanks to Shaun Jackman for the bug
|
||
|
report.
|
||
|
- Added Microsoft Windows support. See README.WINDOWS file for
|
||
|
details.
|
||
|
|
||
|
|
||
|
1.3.2: 22 Apr 2009
|
||
|
------------------
|
||
|
|
||
|
- Fixed a potential infinite loop in the openib BTL that could occur
|
||
|
in senders in some frequent-communication scenarios. Thanks to Don
|
||
|
Wood for reporting the problem.
|
||
|
- Add a new checksum PML variation on ob1 (main MPI point-to-point
|
||
|
communication engine) to detect memory corruption in node-to-node
|
||
|
messages
|
||
|
- Add a new configuration option to add padding to the openib
|
||
|
header so the data is aligned
|
||
|
- Add a new configuration option to use an alternative checksum algo
|
||
|
when using the checksum PML
|
||
|
- Fixed a problem reported by multiple users on the mailing list that
|
||
|
the LSF support would fail to find the appropriate libraries at
|
||
|
run-time.
|
||
|
- Allow empty shell designations from getpwuid(). Thanks to Sergey
|
||
|
Koposov for the bug report.
|
||
|
- Ensure that mpirun exits with non-zero status when applications die
|
||
|
due to user signal. Thanks to Geoffroy Pignot for suggesting the
|
||
|
fix.
|
||
|
- Ensure that MPI_VERSION / MPI_SUBVERSION match what is returned by
|
||
|
MPI_GET_VERSION. Thanks to Rob Egan for reporting the error.
|
||
|
- Updated MPI_*KEYVAL_CREATE functions to properly handle Fortran
|
||
|
extra state.
|
||
|
- A variety of ob1 (main MPI point-to-point communication engine) bug
|
||
|
fixes that could have caused hangs or seg faults.
|
||
|
- Do not install Open MPI's signal handlers in MPI_INIT if there are
|
||
|
already signal handlers installed. Thanks to Kees Verstoep for
|
||
|
bringing the issue to our attention.
|
||
|
- Fix GM support to not seg fault in MPI_INIT.
|
||
|
- Various VampirTrace fixes.
|
||
|
- Various PLPA fixes.
|
||
|
- No longer create BTLs for invalid (TCP) devices.
|
||
|
- Various man page style and lint cleanups.
|
||
|
- Fix critical OpenFabrics-related bug noted here:
|
||
|
http://www.open-mpi.org/community/lists/announce/2009/03/0029.php.
|
||
|
Open MPI now uses a much more robust memory intercept scheme that is
|
||
|
quite similar to what is used by MX. The use of "-lopenmpi-malloc"
|
||
|
is no longer necessary, is deprecated, and is expected to disappear
|
||
|
in a future release. -lopenmpi-malloc will continue to work for the
|
||
|
duration of the Open MPI v1.3 and v1.4 series.
|
||
|
- Fix some OpenFabrics shutdown errors, both regarding iWARP and SRQ.
|
||
|
- Allow the udapl BTL to work on Solaris platforms that support
|
||
|
relaxed PCI ordering.
|
||
|
- Fix problem where the mpirun would sometimes use rsh/ssh to launch on
|
||
|
the localhost (instead of simply forking).
|
||
|
- Minor SLURM stdin fixes.
|
||
|
- Fix to run properly under SGE jobs.
|
||
|
- Scalability and latency improvements for shared memory jobs: convert
|
||
|
to using one message queue instead of N queues.
|
||
|
- Automatically size the shared-memory area (mmap file) to match
|
||
|
better what is needed; specifically, so that large-np jobs will start.
|
||
|
- Use fixed-length MPI predefined handles in order to provide ABI
|
||
|
compatibility between Open MPI releases.
|
||
|
- Fix building of the posix paffinity component to properly get the
|
||
|
number of processors in loosely tested environments (e.g.,
|
||
|
FreeBSD). Thanks to Steve Kargl for reporting the issue.
|
||
|
- Fix --with-libnuma handling in configure. Thanks to Gus Correa for
|
||
|
reporting the problem.
|
||
|
|
||
|
|
||
|
1.3.1: 19 Mar 2009
|
||
|
------------------
|
||
|
|
||
|
- Added "sync" coll component to allow users to synchronize every N
|
||
|
collective operations on a given communicator.
|
||
|
- Increased the default values of the IB and RNR timeout MCA parameters.
|
||
|
- Fix a compiler error noted by Mostyn Lewis with the PGI 8.0 compiler.
|
||
|
- Fix an error that prevented stdin from being forwarded if the
|
||
|
rsh launcher was in use. Thanks to Branden Moore for pointing out
|
||
|
the problem.
|
||
|
- Correct a case where the added datatype is considered as contiguous but
|
||
|
has gaps in the beginning.
|
||
|
- Fix an error that limited the number of comm_spawns that could
|
||
|
simultaneously be running in some environments
|
||
|
- Correct a corner case in OB1's GET protocol for long messages; the
|
||
|
error could sometimes cause MPI jobs using the openib BTL to hang.
|
||
|
- Fix a bunch of bugs in the IO forwarding (IOF) subsystem and add some
|
||
|
new options to output to files and redirect output to xterm. Thanks to
|
||
|
Jody Weissmann for helping test out many of the new fixes and
|
||
|
features.
|
||
|
- Fix SLURM race condition.
|
||
|
- Fix MPI_File_c2f(MPI_FILE_NULL) to return 0, not -1. Thanks to
|
||
|
Lisandro Dalcín for the bug report.
|
||
|
- Fix the DSO build of tm PLM.
|
||
|
- Various fixes for size disparity between C int's and Fortran
|
||
|
INTEGER's. Thanks to Christoph van Wullen for the bug report.
|
||
|
- Ensure that mpirun exits with a non-zero exit status when daemons or
|
||
|
processes abort or fail to launch.
|
||
|
- Various fixes to work around Intel (NetEffect) RNIC behavior.
|
||
|
- Various fixes for mpirun's --preload-files and --preload-binary
|
||
|
options.
|
||
|
- Fix the string name in MPI::ERRORS_THROW_EXCEPTIONS.
|
||
|
- Add ability to forward SIFTSTP and SIGCONT to MPI processes if you
|
||
|
set the MCA parameter orte_forward_job_control to 1.
|
||
|
- Allow the sm BTL to allocate larger amounts of shared memory if
|
||
|
desired (helpful for very large multi-core boxen).
|
||
|
- Fix a few places where we used PATH_MAX instead of OPAL_PATH_MAX,
|
||
|
leading to compile problems on some platforms. Thanks to Andrea Iob
|
||
|
for the bug report.
|
||
|
- Fix mca_btl_openib_warn_no_device_params_found MCA parameter; it
|
||
|
was accidentally being ignored.
|
||
|
- Fix some run-time issues with the sctp BTL.
|
||
|
- Ensure that RTLD_NEXT exists before trying to use it (e.g., it
|
||
|
doesn't exist on Cygwin). Thanks to Gustavo Seabra for reporting
|
||
|
the issue.
|
||
|
- Various fixes to VampirTrace, including fixing compile errors on
|
||
|
some platforms.
|
||
|
- Fixed missing MPI_Comm_accept.3 man page; fixed minor issue in
|
||
|
orterun.1 man page. Thanks to Dirk Eddelbuettel for identifying the
|
||
|
problem and submitting a patch.
|
||
|
- Implement the XML formatted output of stdout/stderr/stddiag.
|
||
|
- Fixed mpirun's -wdir switch to ensure that working directories for
|
||
|
multiple app contexts are properly handled. Thanks to Geoffroy
|
||
|
Pignot for reporting the problem.
|
||
|
- Improvements to the MPI C++ integer constants:
|
||
|
- Allow MPI::SEEK_* constants to be used as constants
|
||
|
- Allow other MPI C++ constants to be used as array sizes
|
||
|
- Fix minor problem with orte-restart's command line options. See
|
||
|
ticket #1761 for details. Thanks to Gregor Dschung for reporting
|
||
|
the problem.
|
||
|
|
||
|
|
||
|
1.3: 19 Jan 2009
|
||
|
----------------
|
||
|
|
||
|
- Extended the OS X 10.5.x (Leopard) workaround for a problem when
|
||
|
assembly code is compiled with -g[0-9]. Thanks to Barry Smith for
|
||
|
reporting the problem. See ticket #1701.
|
||
|
- Disabled MPI_REAL16 and MPI_COMPLEX32 support on platforms where the
|
||
|
bit representation of REAL*16 is different than that of the C type
|
||
|
of the same size (usually long double). Thanks to Julien Devriendt
|
||
|
for reporting the issue. See ticket #1603.
|
||
|
- Increased the size of MPI_MAX_PORT_NAME to 1024 from 36. See ticket #1533.
|
||
|
- Added "notify debugger on abort" feature. See tickets #1509 and #1510.
|
||
|
Thanks to Seppo Sahrakropi for the bug report.
|
||
|
- Upgraded Open MPI tarballs to use Autoconf 2.63, Automake 1.10.1,
|
||
|
Libtool 2.2.6a.
|
||
|
- Added missing MPI::Comm::Call_errhandler() function. Thanks to Dave
|
||
|
Goodell for bringing this to our attention.
|
||
|
- Increased MPI_SUBVERSION value in mpi.h to 1 (i.e., MPI 2.1).
|
||
|
- Changed behavior of MPI_GRAPH_CREATE, MPI_TOPO_CREATE, and several
|
||
|
other topology functions per MPI-2.1.
|
||
|
- Fix the type of the C++ constant MPI::IN_PLACE.
|
||
|
- Various enhancements to the openib BTL:
|
||
|
- Added btl_openib_if_[in|ex]clude MCA parameters for
|
||
|
including/excluding comma-delimited lists of HCAs and ports.
|
||
|
- Added RDMA CM support, includng btl_openib_cpc_[in|ex]clude MCA
|
||
|
parameters
|
||
|
- Added NUMA support to only use "near" network adapters
|
||
|
- Added "Bucket SRQ" (BSRQ) support to better utilize registered
|
||
|
memory, including btl_openib_receive_queues MCA parameter
|
||
|
- Added ConnectX XRC support (and integrated with BSRQ)
|
||
|
- Added btl_openib_ib_max_inline_data MCA parameter
|
||
|
- Added iWARP support
|
||
|
- Revamped flow control mechansisms to be more efficient
|
||
|
- "mpi_leave_pinned=1" is now the default when possible,
|
||
|
automatically improving performance for large messages when
|
||
|
application buffers are re-used
|
||
|
- Elimiated duplicated error messages when multiple MPI processes fail
|
||
|
with the same error.
|
||
|
- Added NUMA support to the shared memory BTL.
|
||
|
- Add Valgrind-based memory checking for MPI-semantic checks.
|
||
|
- Add support for some optional Fortran datatypes (MPI_LOGICAL1,
|
||
|
MPI_LOGICAL2, MPI_LOGICAL4 and MPI_LOGICAL8).
|
||
|
- Remove the use of the STL from the C++ bindings.
|
||
|
- Added support for Platform/LSF job launchers. Must be Platform LSF
|
||
|
v7.0.2 or later.
|
||
|
- Updated ROMIO with the version from MPICH2 1.0.7.
|
||
|
- Added RDMA capable one-sided component (called rdma), which
|
||
|
can be used with BTL components that expose a full one-sided
|
||
|
interface.
|
||
|
- Added the optional datatype MPI_REAL2. As this is added to the "end of"
|
||
|
predefined datatypes in the fortran header files, there will not be
|
||
|
any compatibility issues.
|
||
|
- Added Portable Linux Processor Affinity (PLPA) for Linux.
|
||
|
- Addition of a finer symbols export control via the visibiliy feature
|
||
|
offered by some compilers.
|
||
|
- Added checkpoint/restart process fault tolerance support. Initially
|
||
|
support a LAM/MPI-like protocol.
|
||
|
- Removed "mvapi" BTL; all InfiniBand support now uses the OpenFabrics
|
||
|
driver stacks ("openib" BTL).
|
||
|
- Added more stringent MPI API parameter checking to help user-level
|
||
|
debugging.
|
||
|
- The ptmalloc2 memory manager component is now by default built as
|
||
|
a standalone library named libopenmpi-malloc. Users wanting to
|
||
|
use leave_pinned with ptmalloc2 will now need to link the library
|
||
|
into their application explicitly. All other users will use the
|
||
|
libc-provided allocator instead of Open MPI's ptmalloc2. This change
|
||
|
may be overriden with the configure option enable-ptmalloc2-internal
|
||
|
- The leave_pinned options will now default to using mallopt on
|
||
|
Linux in the cases where ptmalloc2 was not linked in. mallopt
|
||
|
will also only be available if munmap can be intercepted (the
|
||
|
default whenever Open MPI is not compiled with --without-memory-
|
||
|
manager.
|
||
|
- Open MPI will now complain and refuse to use leave_pinned if
|
||
|
no memory intercept / mallopt option is available.
|
||
|
- Add option of using Perl-based wrapper compilers instead of the
|
||
|
C-based wrapper compilers. The Perl-based version does not
|
||
|
have the features of the C-based version, but does work better
|
||
|
in cross-compile environments.
|
||
|
|
||
|
|
||
|
1.2.9: 14 Feb 2009
|
||
|
------------------
|
||
|
|
||
|
- Fix a segfault when using one-sided communications on some forms of derived
|
||
|
datatypes. Thanks to Dorian Krause for reporting the bug. See #1715.
|
||
|
- Fix an alignment problem affecting one-sided communications on
|
||
|
some architectures (e.g., SPARC64). See #1738.
|
||
|
- Fix compilation on Solaris when thread support is enabled in Open MPI
|
||
|
(e.g., when using --with-threads). See #1736.
|
||
|
- Correctly take into account the MTU that an OpenFabrics device port
|
||
|
is using. See #1722 and
|
||
|
https://bugs.openfabrics.org/show_bug.cgi?id=1369.
|
||
|
- Fix two datatype engine bugs. See #1677.
|
||
|
Thanks to Peter Kjellstrom for the bugreport.
|
||
|
- Fix the bml r2 help filename so the help message can be found. See #1623.
|
||
|
- Fix a compilation problem on RHEL4U3 with the PGI 32 bit compiler
|
||
|
caused by <infiniband/driver.h>. See ticket #1613.
|
||
|
- Fix the --enable-cxx-exceptions configure option. See ticket #1607.
|
||
|
- Properly handle when the MX BTL cannot open an endpoint. See ticket #1621.
|
||
|
- Fix a double free of events on the tcp_events list. See ticket #1631.
|
||
|
- Fix a buffer overun in opal_free_list_grow (called by MPI_Init).
|
||
|
Thanks to Patrick Farrell for the bugreport and Stephan Kramer for
|
||
|
the bugfix. See ticket #1583.
|
||
|
- Fix a problem setting OPAL_PREFIX for remote sh-based shells.
|
||
|
See ticket #1580.
|
||
|
|
||
|
|
||
|
1.2.8: 14 Oct 2008
|
||
|
------------------
|
||
|
|
||
|
- Tweaked one memory barrier in the openib component to be more conservative.
|
||
|
May fix a problem observed on PPC machines. See ticket #1532.
|
||
|
- Fix OpenFabrics IB partition support. See ticket #1557.
|
||
|
- Restore v1.1 feature that sourced .profile on remote nodes if the default
|
||
|
shell will not do so (e.g. /bin/sh and /bin/ksh). See ticket #1560.
|
||
|
- Fix segfault in MPI_Init_thread() if ompi_mpi_init() fails. See ticket #1562.
|
||
|
- Adjust SLURM support to first look for $SLURM_JOB_CPUS_PER_NODE instead of
|
||
|
the deprecated $SLURM_TASKS_PER_NODE environment variable. This change
|
||
|
may be *required* when using SLURM v1.2 and above. See ticket #1536.
|
||
|
- Fix the MPIR_Proctable to be in process rank order. See ticket #1529.
|
||
|
- Fix a regression introduced in 1.2.6 for the IBM eHCA. See ticket #1526.
|
||
|
|
||
|
|
||
|
1.2.7: 28 Aug 2008
|
||
|
------------------
|
||
|
|
||
|
- Add some Sun HCA vendor IDs. See ticket #1461.
|
||
|
- Fixed a memory leak in MPI_Alltoallw when called from Fortran.
|
||
|
Thanks to Dave Grote for the bugreport. See ticket #1457.
|
||
|
- Only link in libutil when it is needed/desired. Thanks to
|
||
|
Brian Barret for diagnosing and fixing the problem. See ticket #1455.
|
||
|
- Update some QLogic HCA vendor IDs. See ticket #1453.
|
||
|
- Fix F90 binding for MPI_CART_GET. Thanks to Scott Beardsley for
|
||
|
bringing it to our attention. See ticket #1429.
|
||
|
- Remove a spurious warning message generated in/by ROMIO. See ticket #1421.
|
||
|
- Fix a bug where command-line MCA parameters were not overriding
|
||
|
MCA parameters set from environment variables. See ticket #1380.
|
||
|
- Fix a bug in the AMD64 atomics assembly. Thanks to Gabriele Fatigati
|
||
|
for the bug report and bugfix. See ticket #1351.
|
||
|
- Fix a gather and scatter bug on intercommunicators when the datatype
|
||
|
being moved is 0 bytes. See ticket #1331.
|
||
|
- Some more man page fixes from the Debian maintainers.
|
||
|
See tickets #1324 and #1329.
|
||
|
- Have openib BTL (OpenFabrics support) check for the presence of
|
||
|
/sys/class/infiniband before allowing itself to be used. This check
|
||
|
prevents spurious "OMPI did not find RDMA hardware!" notices on
|
||
|
systems that have the software drivers installed, but no
|
||
|
corresponding hardware. See tickets #1321 and #1305.
|
||
|
- Added vendor IDs for some ConnectX openib HCAs. See ticket #1311.
|
||
|
- Fix some RPM specfile inconsistencies. See ticket #1308.
|
||
|
Thanks to Jim Kusznir for noticing the problem.
|
||
|
- Removed an unused function prototype that caused warnings on
|
||
|
some systems (e.g., OS X). See ticket #1274.
|
||
|
- Fix a deadlock in inter-communicator scatter/gather operations.
|
||
|
Thanks to Martin Audet for the bug report. See ticket #1268.
|
||
|
|
||
|
|
||
|
1.2.6: 7 Apr 2008
|
||
|
-----------------
|
||
|
|
||
|
- Fix a bug in the inter-allgather for asymmetric inter-communicators.
|
||
|
Thanks to Martin Audet for the bug report. See ticket #1247.
|
||
|
- Fix a bug in the openib BTL when setting the CQ depth. Thanks
|
||
|
to Jon Mason for the bug report and fix. See ticket #1245.
|
||
|
- On Mac OS X Leopard, the execinfo component will be used for
|
||
|
backtraces, making for a more durable solution. See ticket #1246.
|
||
|
- Added vendor IDs for some QLogic DDR openib HCAs. See ticket #1227.
|
||
|
- Updated the URL to get the latest config.guess and config.sub files.
|
||
|
Thanks to Ralf Wildenhues for the bug report. See ticket #1226.
|
||
|
- Added shared contexts support to PSM MTL. See ticket #1225.
|
||
|
- Added pml_ob1_use_early_completion MCA parameter to allow users
|
||
|
to turn off the OB1 early completion semantic and avoid "stall"
|
||
|
problems seen on InfiniBand in some cases. See ticket #1224.
|
||
|
- Sanitized some #define macros used in mpi.h to avoid compiler warnings
|
||
|
caused by MPI programs built with different autoconf versions.
|
||
|
Thanks to Ben Allan for reporting the problem, and thanks to
|
||
|
Brian Barrett for the fix. See ticket #1220.
|
||
|
- Some man page fixes from the Debian maintainers. See ticket #1219.
|
||
|
- Made the openib BTL a bit more resilient in the face of driver
|
||
|
errors. See ticket #1217.
|
||
|
- Fixed F90 interface for MPI_CART_CREATE. See ticket #1208.
|
||
|
Thanks to Michal Charemza for reporting the problem.
|
||
|
- Fixed some C++ compiler warnings. See ticket #1203.
|
||
|
- Fixed formatting of the orterun man page. See ticket #1202.
|
||
|
Thanks to Peter Breitenlohner for the patch.
|
||
|
|
||
|
|
||
|
1.2.5: 8 Jan 2008
|
||
|
-----------------
|
||
|
|
||
|
- Fixed compile issue with open() on Fedora 8 (and newer) platforms.
|
||
|
Thanks to Sebastian Schmitzdorff for noticing the problem.
|
||
|
- Added run-time warnings during MPI_INIT when MPI_THREAD_MULTIPLE
|
||
|
and/or progression threads are used (the OMPI v1.2 series does not
|
||
|
support these well at all).
|
||
|
- Better handling of ECONNABORTED from connect on Linux. Thanks to
|
||
|
Bob Soliday for noticing the problem; thanks to Brian Barrett for
|
||
|
submitting a patch.
|
||
|
- Reduce extraneous output from OOB when TCP connections must
|
||
|
be retried. Thanks to Brian Barrett for submitting a patch.
|
||
|
- Fix for ConnectX devices and OFED 1.3. See ticket #1190.
|
||
|
- Fixed a configure problem for Fortran 90 on Cray systems. Ticket #1189.
|
||
|
- Fix an uninitialized variable in the error case in opal_init.c.
|
||
|
Thanks to Åke Sandgren for pointing out the mistake.
|
||
|
- Fixed a hang in configure if $USER was not defined. Thanks to
|
||
|
Darrell Kresge for noticing the problem. See ticket #900.
|
||
|
- Added support for parallel debuggers even when we have an optimized build.
|
||
|
See ticket #1178.
|
||
|
- Worked around a bus error in the Mac OS X 10.5.X (Leopard) linker when
|
||
|
compiling Open MPI with -g. See ticket #1179.
|
||
|
- Removed some warnings about 'rm' from Mac OS X 10.5 (Leopard) builds.
|
||
|
- Fix the handling of mx_finalize(). See ticket #1177.
|
||
|
Thanks to Åke Sandgren for bringing this issue to our attention.
|
||
|
- Fixed minor file descriptor leak in the Altix timer code. Thanks to
|
||
|
Paul Hargrove for noticing the problem and supplying the fix.
|
||
|
- Fix a problem when using a different compiler for C and Objective C.
|
||
|
See ticket #1153.
|
||
|
- Fix segfault in MPI_COMM_SPAWN when the user specified a working
|
||
|
directory. Thanks to Murat Knecht for reporting this and suggesting
|
||
|
a fix.
|
||
|
- A few manpage fixes from the Debian Open MPI maintainers. Thanks to
|
||
|
Tilman Koschnick, Sylvestre Ledru, and Dirk Eddelbuettel.
|
||
|
- Fixed issue with pthread detection when compilers are not all
|
||
|
from the same vendor. Thanks to Åke Sandgren for the bug
|
||
|
report. See ticket #1150.
|
||
|
- Fixed vector collectives in the self module. See ticket #1166.
|
||
|
- Fixed some data-type engine bugs: an indexing bug, and an alignment bug.
|
||
|
See ticket #1165.
|
||
|
- Only set the MPI_APPNUM attribute if it is defined. See ticket
|
||
|
#1164.
|
||
|
|
||
|
|
||
|
1.2.4: 26 Sep 2007
|
||
|
------------------
|
||
|
|
||
|
- Really added support for TotalView/DDT parallel debugger message queue
|
||
|
debugging (it was mistakenly listed as "added" in the 1.2 release).
|
||
|
- Fixed a build issue with GNU/kFreeBSD. Thanks to Petr Salinger for
|
||
|
the patch.
|
||
|
- Added missing MPI_FILE_NULL constant in Fortran. Thanks to
|
||
|
Bernd Schubert for bringing this to our attention.
|
||
|
- Change such that the UDAPL BTL is now only built in Linux when
|
||
|
explicitly specified via the --with-udapl configure command line
|
||
|
switch.
|
||
|
- Fixed an issue with umask not being propagated when using the TM
|
||
|
launcher.
|
||
|
- Fixed behavior if number of slots is not the same on all bproc nodes.
|
||
|
- Fixed a hang on systems without GPR support (ex. Cray XT3/4).
|
||
|
- Prevent users of 32-bit MPI apps from requesting >= 2GB of shared
|
||
|
memory.
|
||
|
- Added a Portals MTL.
|
||
|
- Fix 0 sized MPI_ALLOC_MEM requests. Thanks to Lisandro Dalcín for
|
||
|
pointing out the problem.
|
||
|
- Fixed a segfault crash on large SMPs when doing collectives.
|
||
|
- A variety of fixes for Cray XT3/4 class of machines.
|
||
|
- Fixed which error handler is used when MPI_COMM_SELF is passed
|
||
|
to MPI_COMM_FREE. Thanks to Lisandro Dalcín for the bug report.
|
||
|
- Fixed compilation on platforms that don't have hton/ntoh.
|
||
|
- Fixed a logic problem in the fortran binding for MPI_TYPE_MATCH_SIZE.
|
||
|
Thanks to Jeff Dusenberry for pointing out the problem and supplying
|
||
|
the fix.
|
||
|
- Fixed a problem with MPI_BOTTOM in various places of the f77-interface.
|
||
|
Thanks to Daniel Spangberg for bringing this up.
|
||
|
- Fixed problem where MPI-optional Fortran datatypes were not
|
||
|
correctly initialized.
|
||
|
- Fixed several problems with stdin/stdout forwarding.
|
||
|
- Fixed overflow problems with the sm mpool MCA parameters on large SMPs.
|
||
|
- Added support for the DDT parallel debugger via orterun's --debug
|
||
|
command line option.
|
||
|
- Added some sanity/error checks to the openib MCA parameter parsing
|
||
|
code.
|
||
|
- Updated the udapl BTL to use RDMA capabilities.
|
||
|
- Allow use of the BProc head node if it was allocated to the user.
|
||
|
Thanks to Sean Kelly for reporting the problem and helping debug it.
|
||
|
- Fixed a ROMIO problem where non-blocking I/O errors were not properly
|
||
|
reported to the user.
|
||
|
- Made remote process launch check the $SHELL environment variable if
|
||
|
a valid shell was not otherwise found for the user.
|
||
|
Thanks to Alf Wachsmann for the bugreport and suggested fix.
|
||
|
- Added/updated some vendor IDs for a few openib HCAs.
|
||
|
- Fixed a couple of failures that could occur when specifying devices
|
||
|
for use by the OOB.
|
||
|
- Removed dependency on sysfsutils from the openib BTL for
|
||
|
libibverbs >=v1.1 (i.e., OFED 1.2 and beyond).
|
||
|
|
||
|
|
||
|
1.2.3: 20 Jun 2007
|
||
|
------------------
|
||
|
|
||
|
- Fix a regression in comm_spawn functionality that inadvertently
|
||
|
caused the mapping of child processes to always start at the same
|
||
|
place. Thanks to Prakash Velayutham for helping discover the
|
||
|
problem.
|
||
|
- Fix segfault when a user's home directory is unavailable on a remote
|
||
|
node. Thanks to Guillaume Thomas-Collignon for bringing the issue
|
||
|
to our attention.
|
||
|
- Fix MPI_IPROBE to properly handle MPI_STATUS_IGNORE on mx and psm
|
||
|
MTLs. Thanks to Sophia Corwell for finding this and supplying a
|
||
|
reproducer.
|
||
|
- Fix some error messages in the tcp BTL.
|
||
|
- Use _NSGetEnviron instead of environ on Mac OS X so that there
|
||
|
are no undefined symbols in the shared libraries.
|
||
|
- On OS X, when MACOSX_DEPLOYMENT_TARGET is 10.3 or higher, support
|
||
|
building the Fortran 90 bindings as a shared library. Thanks to
|
||
|
Jack Howarth for his advice on making this work.
|
||
|
- No longer require extra include flag for the C++ bindings.
|
||
|
- Fix detection of weak symbols support with Intel compilers.
|
||
|
- Fix issue found by Josh England: ompi_info would not show framework
|
||
|
MCA parameters set in the environment properly.
|
||
|
- Rename the oob_tcp_include/exclude MCA params to oob_tcp_if_include/exclude
|
||
|
so that they match the naming convention of the btl_tcp_if_include/exclude
|
||
|
params. The old names are depreciated, but will still work.
|
||
|
- Add -wd as a synonym for the -wdir orterun/mpirun option.
|
||
|
- Fix the mvapi BTL to compile properly with compilers that do not support
|
||
|
anonymous unions. Thanks to Luis Kornblueh for reporting the bug.
|
||
|
|
||
|
|
||
|
1.2.2: 16 May 2007
|
||
|
------------------
|
||
|
|
||
|
- Fix regression in 1.2.1 regarding the handling of $CC with both
|
||
|
absolute and relative path names.
|
||
|
- Fix F90 array of status dimensions. Thanks to Randy Bramley for
|
||
|
noticing the problem.
|
||
|
- Add btl_openib_ib_pkey_value MCA parameter for controlling IB port selection.
|
||
|
- Fixed a variety of threading/locking bugs.
|
||
|
- Fixed some compiler warnings associated with ROMIO, OS X, and gridengine.
|
||
|
- If pbs-config can be found, use it to look for TM support. Thanks
|
||
|
to Bas van der Vlies for the inspiration and preliminary work.
|
||
|
- Fixed a deadlock in orterun when the rsh PLS encounters some errors.
|
||
|
|
||
|
|
||
|
1.2.1: 25 Apr 2007
|
||
|
------------------
|
||
|
|
||
|
- Fixed a number of connection establishment errors in the TCP out-
|
||
|
of-band messaging system.
|
||
|
- Fixed a memory leak when using mpi_comm calls.
|
||
|
Thanks to Bas van der Vlies for reporting the problem.
|
||
|
- Fixed various memory leaks in OPAL and ORTE.
|
||
|
- Improved launch times when using TM (PBS Pro, Torque, Open PBS).
|
||
|
- Fixed mpi_leave_pinned to work for all datatypes.
|
||
|
- Fix functionality allowing users to disable sbrk() (the
|
||
|
mpool_base_disable_sbrk MCA parameter) on platforms that support it.
|
||
|
- Fixed a pair of problems with the TCP "listen_thread" mode for the
|
||
|
oob_tcp_listen_mode MCA parameter that would cause failures when
|
||
|
attempting to launch applications.
|
||
|
- Fixed a segfault if there was a failure opening a BTL MX endpoint.
|
||
|
- Fixed a problem with mpirun's --nolocal option introduced in 1.2.
|
||
|
- Re-enabled MPI_COMM_SPAWN_MULTIPLE from singletons.
|
||
|
- LoadLeveler and TM configure fixes, Thanks to Martin Audet for the
|
||
|
bug report.
|
||
|
- Various C++ MPI attributes fixes.
|
||
|
- Fixed issues with backtrace code on 64 bit Intel & PPC OS X builds.
|
||
|
- Fixed issues with multi-word CC variables and libtool.
|
||
|
Thanks to Bert Wesarg for the bug reports.
|
||
|
- Fix issue with non-uniform node naming schemes in SLURM.
|
||
|
- Fix file descriptor leak in the Grid Engine/N1GE support.
|
||
|
- Fix compile error on OS X 10.3.x introduced with Open MPI 1.1.5.
|
||
|
- Implement MPI_TYPE_CREATE_DARRAY function (was in 1.1.5 but not 1.2).
|
||
|
- Recognize zsh shell when using rsh/ssh for launching MPI jobs.
|
||
|
- Ability to set the OPAL_DESTDIR or OPAL_PREFIX environment
|
||
|
variables to "re-root" an existing Open MPI installation.
|
||
|
- Always include -I for Fortran compiles, even if the prefix is
|
||
|
/usr/local.
|
||
|
- Support for "fork()" in MPI applications that use the
|
||
|
OpenFabrics stack (OFED v1.2 or later).
|
||
|
- Support for setting specific limits on registered memory.
|
||
|
|
||
|
|
||
|
1.2: 15 Mar 2007
|
||
|
----------------
|
||
|
|
||
|
- Fixed race condition in the shared memory fifo's, which led to
|
||
|
orphaned messages.
|
||
|
- Corrected the size of the shared memory file - subtracted out the
|
||
|
space the header was occupying.
|
||
|
- Add support for MPI_2COMPLEX and MPI_2DOUBLE_COMPLEX.
|
||
|
- Always ensure to create $(includedir)/openmpi, even if the C++
|
||
|
bindings are disabled so that the wrapper compilers don't point to
|
||
|
a directory that doesn't exist. Thanks to Martin Audet for
|
||
|
identifying the problem.
|
||
|
- Fixes for endian handling in MPI process startup.
|
||
|
- Openib BTL initialization fixes for cases where MPI processes in the
|
||
|
same job has different numbers of active ports on the same physical
|
||
|
fabric.
|
||
|
- Print more descriptive information when displaying backtraces on
|
||
|
OS's that support this functionality, such as the hostname and PID
|
||
|
of the process in question.
|
||
|
- Fixes to properly handle MPI exceptions in C++ on communicators,
|
||
|
windows, and files.
|
||
|
- Much more reliable runtime support, particularly with regards to MPI
|
||
|
job startup scalability, BProc support, and cleanup in failure
|
||
|
scenarios (e.g., MPI_ABORT, MPI processes abnormally terminating,
|
||
|
etc.).
|
||
|
- Significant performance improvements for MPI collectives,
|
||
|
particularly on high-speed networks.
|
||
|
- Various fixes in the MX BTL component.
|
||
|
- Fix C++ typecast problems with MPI_ERRCODES_IGNORE. Thanks to
|
||
|
Satish Balay for bringing this to our attention.
|
||
|
- Allow run-time specification of the maximum amount of registered
|
||
|
memory for OpenFabrics and GM.
|
||
|
- Users who utilize the wrapper compilers (e.g., mpicc and mpif77)
|
||
|
will not notice, but the underlying library names for ORTE and OPAL
|
||
|
have changed to libopen-rte and libopen-pal, respectively (listed
|
||
|
here because there are undoubtedly some users who are not using the
|
||
|
wrapper compilers).
|
||
|
- Many bug fixes to MPI-2 one-sided support.
|
||
|
- Added support for TotalView message queue debugging.
|
||
|
- Fixes for MPI_STATUS_SET_ELEMENTS.
|
||
|
- Print better error messages when mpirun's "-nolocal" is used when
|
||
|
there is only one node available.
|
||
|
- Added man pages for several Open MPI executables and the MPI API
|
||
|
functions.
|
||
|
- A number of fixes for Alpha platforms.
|
||
|
- A variety of Fortran API fixes.
|
||
|
- Build the Fortran MPI API as a separate library to allow these
|
||
|
functions to be profiled properly.
|
||
|
- Add new --enable-mpirun-prefix-by-default configure option to always
|
||
|
imply the --prefix option to mpirun, preventing many rsh/ssh-based
|
||
|
users from needing to modify their shell startup files.
|
||
|
- Add a number of missing constants in the C++ bindings.
|
||
|
- Added tight integration with Sun N1 Grid Engine (N1GE) 6 and the
|
||
|
open source Grid Engine.
|
||
|
- Allow building the F90 MPI bindings as shared libraries for most
|
||
|
compilers / platforms. Explicitly disallow building the F90
|
||
|
bindings as shared libraries on OS X because of complicated
|
||
|
situations with Fortran common blocks and lack of support for
|
||
|
unresolved common symbols in shared libraries.
|
||
|
- Added stacktrace support for Solaris and Mac OS X.
|
||
|
- Update event library to libevent-1.1b.
|
||
|
- Fixed standards conformance issues with MPI_ERR_TRUNCATED and
|
||
|
setting MPI_ERROR during MPI_TEST/MPI_WAIT.
|
||
|
- Addition of "cm" PML to better support library-level matching
|
||
|
interconnects, with support for Myrinet/MX, and QLogic PSM-based
|
||
|
networks.
|
||
|
- Addition of "udapl" BTL for transport across uDAPL interconnects.
|
||
|
- Really check that the $CXX given to configure is a C++ compiler
|
||
|
(not a C compiler that "sorta works" as a C++ compiler).
|
||
|
- Properly check for local host only addresses properly, looking
|
||
|
for 127.0.0.0/8, rather than just 127.0.0.1.
|
||
|
|
||
|
|
||
|
1.1.5: 19 Mar 2007
|
||
|
------------------
|
||
|
|
||
|
- Implement MPI_TYPE_CREATE_DARRAY function.
|
||
|
- Fix race condition in shared memory BTL startup that could cause MPI
|
||
|
applications to hang in MPI_INIT.
|
||
|
- Fix syntax error in a corner case of the event library. Thanks to
|
||
|
Bert Wesarg for pointing this out.
|
||
|
- Add new MCA parameter (mpi_preconnect_oob) for pre-connecting the
|
||
|
"out of band" channels between all MPI processes. Most helpful for
|
||
|
MPI applications over InfiniBand where process A sends an initial
|
||
|
message to process B, but process B does not enter the MPI library
|
||
|
for a long time.
|
||
|
- Fix for a race condition in shared memory locking semantics.
|
||
|
- Add major, minor, and release version number of Open MPI to mpi.h.
|
||
|
Thanks to Martin Audet for the suggestion.
|
||
|
- Fix the "restrict" compiler check in configure.
|
||
|
- Fix a problem with argument checking in MPI_TYPE_CREATE_SUBARRAY.
|
||
|
- Fix a problem with compiling the XGrid components with non-gcc
|
||
|
compilers.
|
||
|
|
||
|
|
||
|
1.1.4: 30 Jan 2007
|
||
|
------------------
|
||
|
|
||
|
- Fixed 64-bit alignment issues with TCP interface detection on
|
||
|
intel-based OS X machines.
|
||
|
- Adjusted TCP interface selection to automatically ignore Linux
|
||
|
channel-bonded slave interfaces.
|
||
|
- Fixed the type of the first parameter to the MPI F90 binding for
|
||
|
MPI_INITIALIZED. Thanks to Tim Campbell for pointing out the
|
||
|
problem.
|
||
|
- Fix a bunch of places in the Fortran MPI bindings where (MPI_Fint*)
|
||
|
was mistakenly being used instead of (MPI_Aint*).
|
||
|
- Fixes for fortran MPI_STARTALL, which could sometimes return
|
||
|
incorrect request values. Thanks to Tim Campbell for pointing out
|
||
|
the problem.
|
||
|
- Include both pre- and post-MPI-2 errata bindings for
|
||
|
MPI::Win::Get_attr.
|
||
|
- Fix math error on Intel OS X platforms that would greatly increase
|
||
|
shared memory latency.
|
||
|
- Fix type casting issue with MPI_ERRCODES_IGNORE that would cause
|
||
|
errors when using a C++ compiler. Thanks to Barry Smith for
|
||
|
bringing this to our attention.
|
||
|
- Fix possible segmentation fault during shutdown when using the
|
||
|
MX BTL.
|
||
|
|
||
|
|
||
|
1.1.3: 26 Jan 2007
|
||
|
------------------
|
||
|
|
||
|
- Remove the "hierarch" coll component; it was not intended to be
|
||
|
included in stable releases yet.
|
||
|
- Fix a race condition with stdout/stderr not appearing properly from
|
||
|
all processes upon termination of an MPI job.
|
||
|
- Fix internal accounting errors with the self BTL.
|
||
|
- Fix typos in the code path for when sizeof(int) != sizeof(INTEGER)
|
||
|
in the MPI F77 bindings functions. Thanks to Pierre-Matthieu
|
||
|
Anglade for bringing this problem to our attention.
|
||
|
- Fix for a memory leak in the derived datatype function
|
||
|
ompi_ddt_duplicate(). Thanks to Andreas Schäfer for reporting,
|
||
|
diagnosing, and patching the leak.
|
||
|
- Used better performing basic algorithm for MPI_ALLGATHERV.
|
||
|
- Added a workaround for a bug in the Intel 9.1 C++ compiler (all
|
||
|
versions up to and including 20060925) in the MPI C++ bindings that
|
||
|
caused run-time failures. Thanks to Scott Weitzenkamp for reporting
|
||
|
this problem.
|
||
|
- Fix MPI_SIZEOF implementation in the F90 bindings for COMPLEX
|
||
|
variable types.
|
||
|
- Fixes for persistent requests involving MPI_PROC_NULL. Thanks to
|
||
|
Lisandro Dalcín for reporting the problem.
|
||
|
- Fixes to MPI_TEST* and MPI_WAIT* for proper MPI exception reporting.
|
||
|
Thanks to Lisandro Dalcín for finding the issue.
|
||
|
- Various fixes for MPI generalized request handling; addition of
|
||
|
missing MPI::Grequest functionality to the C++ bindings.
|
||
|
- Add "mpi_preconnect_all" MCA parameter to force wireup of all MPI
|
||
|
connections during MPI_INIT (vs. making connections lazily whenever
|
||
|
the first MPI communication occurs between a pair of peers).
|
||
|
- Fix a problem for when $FC and/or $F77 were specified as multiple
|
||
|
tokens. Thanks to Orion Poplawski for identifying the problem and
|
||
|
to Ralf Wildenhues for suggesting the fix.
|
||
|
- Fix several MPI_*ERRHANDLER* functions and MPI_GROUP_TRANSLATE_RANKS
|
||
|
with respect to what arguments they allowed and the behavior that
|
||
|
they effected. Thanks to Lisandro Dalcín for reporting the
|
||
|
problems.
|
||
|
|
||
|
|
||
|
1.1.2: 18 Oct 2006
|
||
|
------------------
|
||
|
|
||
|
- Really fix Fortran status handling in MPI_WAITSOME and MPI_TESTSOME.
|
||
|
- Various datatype fixes, reported by several users as causing
|
||
|
failures in the BLACS testing suite. Thanks to Harald Forbert, Åke
|
||
|
Sandgren and, Michael Kluskens for reporting the problem.
|
||
|
- Correctness and performance fixes for heterogeneous environments.
|
||
|
- Fixed a error in command line parsing on some platforms (causing
|
||
|
mpirun to crash without doing anything).
|
||
|
- Fix for initialization hangs on 64 bit Mac OS X PowerPC systems.
|
||
|
- Fixed some memory allocation problems in mpirun that could cause
|
||
|
random problems if "-np" was not specified on the command line.
|
||
|
- Add Kerberos authentication support for XGrid.
|
||
|
- Added LoadLeveler support for jobs larger than 128 tasks.
|
||
|
- Fix for large-sized Fortran LOGICAL datatypes.
|
||
|
- Fix various error checking in MPI_INFO_GET_NTHKEY and
|
||
|
MPI_GROUP_TRANSLATE_RANKS, and some collective operations
|
||
|
(particularly with regards to MPI_IN_PLACE). Thanks to Lisandro
|
||
|
Dalcín for reporting the problems.
|
||
|
- Fix receiving messages to buffers allocated by MPI_ALLOC_MEM.
|
||
|
- Fix a number of race conditions with the MPI-2 Onesided
|
||
|
interface.
|
||
|
- Fix the "tuned" collective componenete where some cases where
|
||
|
MPI_BCAST could hang.
|
||
|
- Update TCP support to support non-uniform TCP environments.
|
||
|
- Allow the "poe" RAS component to be built on AIX or Linux.
|
||
|
- Only install mpif.h if the rest of the Fortran bindings are
|
||
|
installed.
|
||
|
- Fixes for BProc node selection.
|
||
|
- Add some missing Fortran MPI-2 IO constants.
|
||
|
|
||
|
|
||
|
1.1.1: 28 Aug 2006
|
||
|
------------------
|
||
|
|
||
|
- Fix for Fortran string handling in various MPI API functions.
|
||
|
- Fix for Fortran status handling in MPI_WAITSOME and MPI_TESTSOME.
|
||
|
- Various fixes for the XL compilers.
|
||
|
- Automatically disable using mallot() on AIX.
|
||
|
- Memory fixes for 64 bit platforms with registering MCA parameters in
|
||
|
the self and MX BTL components.
|
||
|
- Fixes for BProc to support oversubscription and changes to the
|
||
|
mapping algorithm so that mapping processes "by slot" works as
|
||
|
expected.
|
||
|
- Fixes for various abort cases to not hang and clean up nicely.
|
||
|
- If using the Intel 9.0 v20051201 compiler on an IA64 platform, the
|
||
|
ptmalloc2 memory manager component will automatically disable
|
||
|
itself. Other versions of the Intel compiler on this platform seem
|
||
|
to work fine (e.g., 9.1).
|
||
|
- Added "host" MPI_Info key to MPI_COMM_SPAWN and
|
||
|
MPI_COMM_SPAWN_MULTIPLE.
|
||
|
- Add missing C++ methods: MPI::Datatype::Create_indexed_block,
|
||
|
MPI::Datatype::Create_resized, MPI::Datatype::Get_true_extent.
|
||
|
- Fix OSX linker issue with Fortran bindings.
|
||
|
- Fixed MPI_COMM_SPAWN to start spawning new processes in slots that
|
||
|
(according to Open MPI) are not already in use.
|
||
|
- Added capability to "mpirun a.out" (without specifying -np) that
|
||
|
will run on all currently-allocated resources (e.g., within a batch
|
||
|
job such as SLURM, Torque, etc.).
|
||
|
- Fix a bug with one particular case of MPI_BCAST. Thanks to Doug
|
||
|
Gregor for identifying the problem.
|
||
|
- Ensure that the shared memory mapped file is only created when there
|
||
|
is more than one process on a node.
|
||
|
- Fixed problems with BProc stdin forwarding.
|
||
|
- Fixed problem with MPI_TYPE_INDEXED datatypes. Thanks to Yven
|
||
|
Fournier for identifying this problem.
|
||
|
- Fix some thread safety issues in MPI attributes and the openib BTL.
|
||
|
- Fix the BProc allocator to not potentially use the same resources
|
||
|
across multiple ORTE universes.
|
||
|
- Fix gm resource leak.
|
||
|
- More latency reduction throughout the code base.
|
||
|
- Make the TM PLS (PBS Pro, Torque, Open PBS) more scalable, and fix
|
||
|
some latent bugs that crept in v1.1. Thanks to the Thunderbird crew
|
||
|
at Sandia National Laboratories and Martin Schaffoner for access to
|
||
|
testing facilities to make this happen.
|
||
|
- Added new command line options to mpirun:
|
||
|
--nolocal: Do not run any MPI processes on the same node as mpirun
|
||
|
(compatibility with the OSC mpiexec launcher)
|
||
|
--nooversubscribe: Abort if the number of processes requested would
|
||
|
cause oversubscription
|
||
|
--quiet / -q: do not show spurious status messages
|
||
|
--version / -V: show the version of Open MPI
|
||
|
- Fix bus error in XGrid process starter. Thanks to Frank from the
|
||
|
Open MPI user's list for identifying the problem.
|
||
|
- Fix data size mismatches that caused memory errors on PPC64
|
||
|
platforms during the startup of the openib BTL.
|
||
|
- Allow propagation of SIGUSR1 and SIGUSR2 signals from mpirun to
|
||
|
back-end MPI processes.
|
||
|
- Add missing MPI::Is_finalized() function.
|
||
|
|
||
|
|
||
|
1.1: 23 Jun 2006
|
||
|
----------------
|
||
|
|
||
|
- Various MPI datatype fixes, optimizations.
|
||
|
- Fixed various problems on the SPARC architecture (e.g., not
|
||
|
correctly aligning addresses within structs).
|
||
|
- Improvements in various run-time error messages to be more clear
|
||
|
about what they mean and where the errors are occurring.
|
||
|
- Various fixes to mpirun's handling of --prefix.
|
||
|
- Updates and fixes for Cray/Red Storm support.
|
||
|
- Major improvements to the Fortran 90 MPI bindings:
|
||
|
- General improvements in compile/linking time and portability
|
||
|
between different F90 compilers.
|
||
|
- Addition of "trivial", "small" (the default), and "medium"
|
||
|
Fortran 90 MPI module sizes (v1.0.x's F90 module was
|
||
|
equivalent to "medium"). See the README file for more
|
||
|
explanation.
|
||
|
- Fix various MPI F90 interface functions and constant types to
|
||
|
match. Thanks to Michael Kluskens for pointing out the problems
|
||
|
to us.
|
||
|
- Allow short messagees to use RDMA (vs. send/receive semantics) to a
|
||
|
limited number peers in both the mvapi and openib BTL components.
|
||
|
This reduces communication latency over IB channels.
|
||
|
- Numerous performance improvements throughout the entire code base.
|
||
|
- Many minor threading fixes.
|
||
|
- Add a define OMPI_SKIP_CXX to allow the user to skip the mpicxx.h from
|
||
|
being included in mpi.h. It allows the user to compile C code with a CXX
|
||
|
compiler without including the CXX bindings.
|
||
|
- PERUSE support has been added. In order to activate it add
|
||
|
--enable-peruse to the configure options. All events described in
|
||
|
the PERUSE 2.0 draft are supported, plus one Open MPI
|
||
|
extension. PERUSE_COMM_REQ_XFER_CONTINUE allow to see how the data
|
||
|
is segmented internally, using multiple interfaces or the pipeline
|
||
|
engine. However, this version only support one event of each type
|
||
|
simultaneously attached to a communicator.
|
||
|
- Add support for running jobs in heterogeneous environments.
|
||
|
Currently supports environments with different endianness and
|
||
|
different representations of C++ bool and Fortran LOGICAL.
|
||
|
Mismatched sizes for other datatypes is not supported.
|
||
|
- Open MPI now includes an implementation of the MPI-2 One-Sided
|
||
|
Communications specification.
|
||
|
- Open MPI is now configurable in cross-compilation environments.
|
||
|
Several Fortran 77 and Fortran 90 tests need to be pre-seeded with
|
||
|
results from a config.cache-like file.
|
||
|
- Add --debug option to mpirun to generically invoke a parallel debugger.
|
||
|
|
||
|
|
||
|
1.0.3: Not released (all fixes included in 1.1)
|
||
|
-----------------------------------------------
|
||
|
|
||
|
- Fix a problem noted by Chris Hennes where MPI_INFO_SET incorrectly
|
||
|
disallowed long values.
|
||
|
- Fix a problem in the launch system that could cause inconsistent
|
||
|
launch behavior, particularly when launching large jobs.
|
||
|
- Require that the openib BTL find <sysfs/libsysfs.h>. Thanks to Josh
|
||
|
Aune for the suggestion.
|
||
|
- Include updates to support the upcoming Autoconf 2.60 and Libtool
|
||
|
2.0. Thanks to Ralf Wildenhues for all the work!
|
||
|
- Fix bug with infinite loop in the "round robin" process mapper.
|
||
|
Thanks to Paul Donohue for reporting the problem.
|
||
|
- Enusre that memory hooks are removed properly during MPI_FINALIZE.
|
||
|
Thanks to Neil Ludban for reporting the problem.
|
||
|
- Various fixes to the included support for ROMIO.
|
||
|
- Fix to ensure that MPI_LONG_LONG and MPI_LONG_LONG_INT are actually
|
||
|
synonyms, as defined by the MPI standard. Thanks to Martin Audet
|
||
|
for reporting this.
|
||
|
- Fix Fortran 90 configure tests to properly utilize LDFLAGS and LIBS.
|
||
|
Thanks to Terry Reeves for reporting the problem.
|
||
|
- Fix shared memory progression in asynchronous progress scenarios.
|
||
|
Thanks to Mykael Bouquey for reporting the problem.
|
||
|
- Fixed back-end operations for predefined MPI_PROD for some
|
||
|
datatypes. Thanks to Bert Wesarg for reporting this.
|
||
|
- Adapted configure to be able to handle Torque 2.1.0p0's (and above)
|
||
|
new library name. Thanks to Brock Palen for pointing this out and
|
||
|
providing access to a Torque 2.1.0p0 cluster to test with.
|
||
|
- Fixed situation where mpirun could set a shell pipeline's stdout
|
||
|
to non-blocking, causing the shell pipeline to prematurely fail.
|
||
|
Thanks to Darrell Kresge for figuring out what was happening.
|
||
|
- Fixed problems with leave_pinned that could cause Badness with the
|
||
|
mvapi BTL.
|
||
|
- Fixed problems with MPI_FILE_OPEN and non-blocking MPI-2 IO access.
|
||
|
- Fixed various InfiniBand port matching issues during startup.
|
||
|
Thanks to Scott Weitzenkamp for identifying these problems.
|
||
|
- Fixed various configure, build and run-time issues with ROMIO.
|
||
|
Thanks to Dries Kimpe for bringing them to our attention.
|
||
|
- Fixed error in MPI_COMM_SPLIT when dealing with intercommunicators.
|
||
|
Thanks to Bert Wesarg for identifying the problem.
|
||
|
- Fixed backwards handling of "high" parameter in MPI_INTERCOMM_MERGE.
|
||
|
Thanks to Michael Kluskens for pointing this out to us.
|
||
|
- Fixed improper handling of string arguments in Fortran bindings
|
||
|
for MPI-IO functionality
|
||
|
- Fixed segmentation fault with 64 bit applications on Solaris when
|
||
|
using the shared memory transports.
|
||
|
- Fixed MPI_COMM_SELF attributes to free properly at the beginning of
|
||
|
MPI_FINALIZE. Thanks to Martin Audet for bringing this to our
|
||
|
attention.
|
||
|
- Fixed alignment tests for cross-compiling to not cause errors with
|
||
|
recent versions of GCC.
|
||
|
|
||
|
|
||
|
1.0.2: 7 Apr 2006
|
||
|
-----------------
|
||
|
|
||
|
- Fixed assembly race condition on AMD64 platforms.
|
||
|
- Fixed residual .TRUE. issue with copying MPI attributes set from
|
||
|
Fortran.
|
||
|
- Remove unnecessary logic from Solaris pty I/O forwarding. Thanks to
|
||
|
Francoise Roch for bringing this to our attention.
|
||
|
- Fixed error when count = 0 was given for multiple completion MPI
|
||
|
functions (MPI_TESTSOME, MPI_TESTANY, MPI_TESTALL, MPI_WAITSOME,
|
||
|
MPI_WAITANY, MPI_WAITALL).
|
||
|
- Better handling in MPI_ABORT for when peer processes have already
|
||
|
died, especially under some resource managers.
|
||
|
- Random updates to README file, to include notes about the Portland
|
||
|
compilers.
|
||
|
- Random, small threading fixes to prevent deadlock.
|
||
|
- Fixed a problem with handling long mpirun app files. Thanks to Ravi
|
||
|
Manumachu for identifying the problem.
|
||
|
- Fix handling of strings in several of the Fortran 77 bindings.
|
||
|
- Fix LinuxPPC assembly issues. Thanks to Julian Seward for reporting
|
||
|
the problem.
|
||
|
- Enable pty support for standard I/O forwarding on platforms that
|
||
|
have ptys but do not have openpty(). Thanks to Pierre Valiron for
|
||
|
bringing this to our attention.
|
||
|
- Disable inline assembly for PGI compilers to avoid compiler errors.
|
||
|
Thanks to Troy Telford for bringing this to our attention.
|
||
|
- Added MPI_UNSIGNED_CHAR and MPI_SIGNED_CHAR to the allowed reduction
|
||
|
types.
|
||
|
- Fix a segv in variable-length message displays on Opterons running
|
||
|
Solaris. Thanks to Pierre Valiron for reporting the issue.
|
||
|
- Added MPI_BOOL to the intrinsic reduction operations MPI_LAND,
|
||
|
MPI_LOR, MPI_LXOR. Thanks to Andy Selle for pointing this out to us.
|
||
|
- Fixed TCP BTL network matching logic during MPI_INIT; in some cases
|
||
|
on multi-NIC nodes, a NIC could get paired with a NIC on another
|
||
|
network (typically resulting in deadlock). Thanks to Ken Mighell
|
||
|
for pointing this out to us.
|
||
|
- Change the behavior of orterun (mpirun, mpirexec) to search for
|
||
|
argv[0] and the cwd on the target node (i.e., the node where the
|
||
|
executable will be running in all systems except BProc, where the
|
||
|
searches are run on the node where orterun is invoked).
|
||
|
- Fix race condition in shared memory transport that could cause
|
||
|
crashes on machines with weak memory consistency models (including
|
||
|
POWER/PowerPC machines).
|
||
|
- Fix warnings about setting read-only MCA parameters on bproc systems.
|
||
|
- Change the exit status set by mpirun when an application process is
|
||
|
killed by a signal. The exit status is now set to signo + 128, which
|
||
|
conforms with the behavior of (almost) all shells.
|
||
|
- Correct a datatype problem with the convertor when partially
|
||
|
unpacking data. Now we can position the convertor to any position
|
||
|
not only on the predefined types boundaries. Thanks to Yvan Fournier
|
||
|
for reporting this to us.
|
||
|
- Fix a number of standard I/O forwarding issues, including the
|
||
|
ability to background mpirun and a loss of data issue when
|
||
|
redirecting mpirun's standard input from a file.
|
||
|
- Fixed bug in ompi_info where rcache and bml MCA parameters would not
|
||
|
be displayed.
|
||
|
- Fixed umask issues in the session directory. Thanks to Glenn Morris
|
||
|
for reporting this to us.
|
||
|
- Fixed tcsh-based LD_LIBRARY_PATH issues with --prefix. Thanks to
|
||
|
Glen Morris for identifying the problem and suggesting the fix.
|
||
|
- Removed extraneous \n's when setting PATH and LD_LIBRARY_PATH in the
|
||
|
rsh startup. Thanks to Glen Morris for finding these typos.
|
||
|
- Fixed missing constants in MPI C++ bindings.
|
||
|
- Fixed some errors caused by threading issues.
|
||
|
- Fixed openib BTL flow control logic to not overrun the number of
|
||
|
send wqes available.
|
||
|
- Update to match newest OpenIB user-level library API. Thanks to
|
||
|
Roland Dreier for submitting this patch.
|
||
|
- Report errors properly when failing to register memory in the openib
|
||
|
BTL.
|
||
|
- Reduce memory footprint of openib BTL.
|
||
|
- Fix parsing problem with mpirun's "-tv" switch. Thanks to Chris
|
||
|
Gottbrath for supplying the fix.
|
||
|
- Fix Darwin net/if.h configure warning.
|
||
|
- The GNU assembler unbelievably defaults to making stacks executable.
|
||
|
So when using gas, add flags to explicitly tell it to not make
|
||
|
stacks executable (lame but necessary).
|
||
|
- Add missing MPI::Request::Get_status() methods. Thanks to Bill
|
||
|
Saphir for pointing this out to us.
|
||
|
- Improved error messages on memory registration errors (e.g., when
|
||
|
using high-speed networks).
|
||
|
- Open IB support now checks firmware for how many outstanding RDMA
|
||
|
requests are supported. Thanks to Mellanox for pointing this out to
|
||
|
us.
|
||
|
- Enable printing of stack traces in MPI processes upon SIGBUS,
|
||
|
SIGSEGV, and SIGFPE if the platform supports it.
|
||
|
- Fixed F90 compilation support for the Lahey compiler.
|
||
|
- Fixed issues with ROMIO shared library support.
|
||
|
- Fixed internal accounting problems with rsh support.
|
||
|
- Update to GNU Libtool 1.5.22.
|
||
|
- Fix error in configure script when setting CCAS to ias (the Intel
|
||
|
assembler).
|
||
|
- Added missing MPI::Intercomm collectives.
|
||
|
- Fixed MPI_IN_PLACE handling for Fortran collectives.
|
||
|
- Fixed some more C++ const_cast<> issues. Thanks for Martin Audet
|
||
|
(again) for bringing this to our attention.
|
||
|
- Updated ROMIO with the version from MPICH 1.2.7p1, marked as version
|
||
|
2005-06-09.
|
||
|
- Fixes for some cases where the use of MPI_BOTTOM could cause
|
||
|
problems.
|
||
|
- Properly handle the case where an mVAPI does not have shared receive
|
||
|
queue support (such as the one shipped by SilverStorm / Infinicon
|
||
|
for OS X).
|
||
|
|
||
|
|
||
|
1.0.1: 12 Dec 2005
|
||
|
------------------
|
||
|
|
||
|
- Fixed assembly on Solaris AMD platforms. Thanks to Pierre Valiron
|
||
|
for bringing this to our attention.
|
||
|
- Fixed long messages in the send-to-self case.
|
||
|
- Ensure that when the "leave_pinned" option is used, the memory hooks
|
||
|
are also enabled. Thanks to Gleb Natapov for pointing this out.
|
||
|
- Fixed compile errors for IRIX.
|
||
|
- Allow hostfiles to have integer host names (for BProc clusters).
|
||
|
- Fixed a problem with message matching of out-of-order fragments in
|
||
|
multiple network device scenarios.
|
||
|
- Converted all the C++ MPI bindings to use proper const_cast<>'s
|
||
|
instead of old C-style casts to get rid of const-ness. Thanks to
|
||
|
Martin Audet for raising the issue with us.
|
||
|
- Converted MPI_Offset to be a typedef instead of a #define because it
|
||
|
causes problems for some C++ parsers. Thanks to Martin Audet for
|
||
|
bringing this to our attention.
|
||
|
- Improved latency of TCP BTL.
|
||
|
- Fixed index value in MPI_TESTANY to be MPI_UNDEFINED if some
|
||
|
requests were not MPI_REQUEST_NULL, but no requests finished.
|
||
|
- Fixed several Fortran MPI API implementations that incorrectly used
|
||
|
integers instead of logicals or address-sized integers.
|
||
|
- Fix so that Open MPI correctly handles the Fortran value for .TRUE.,
|
||
|
regardless of what the Fortran compiler's value for .TRUE. is.
|
||
|
- Improved scalability of MX startup.
|
||
|
- Fix datatype offset handling in the coll basic component's
|
||
|
MPI_SCATTERV implementation.
|
||
|
- Fix EOF handling on stdin.
|
||
|
- Fix missing MPI_F_STATUS_IGNORE and MPI_F_STATUSES_IGNORE
|
||
|
instanatiations. Thanks to Anthony Chan for pointing this out.
|
||
|
- Add a missing value for MPI_WIN_NULL in mpif.h.
|
||
|
- Bring over some fixes for the sm btl that somehow didn't make it
|
||
|
over from the trunk before v1.0. Thanks to Beth Tibbitts and Bill
|
||
|
Chung for helping identify this issue.
|
||
|
- Bring over some fixes for the iof that somehow didn't make it over
|
||
|
from the trunk before v1.0.
|
||
|
- Fix for --with-wrapper-ldflags handling. Thanks to Dries Kimpe for
|
||
|
pointing this out to us.
|
||
|
|
||
|
|
||
|
1.0: 17 Nov 2005
|
||
|
----------------
|
||
|
|
||
|
Initial public release.
|