Commit Graph

754 Commits (852ee3e40e8e32abf354cc68738e5e3f9e43ae23)

Author SHA1 Message Date
Julian M. Kunkel 852ee3e40e AIORI POSIX use internal debug macros, include errno in msg. 2021-01-21 11:06:13 +00:00
Julian Kunkel d5f5cf974d
Merge pull request #318 from hpc/fix-comm
Refactoring: remove global comm_world communicator from utilities.c
2021-01-20 18:25:53 +00:00
Julian Kunkel 9e0a8c44d5
IOR: move verbose output of detailed errors to verbosity level 1. (#315)
IOR by default outputs the numbers of errors.
Reason: The amount of error messages can be overwhelming, particularly in a parallel program (Gigabytes...).
One -v increases the verbosity level to provide the extra details without adding too many other messages.
2021-01-20 15:06:45 +00:00
Julian M. Kunkel f345a78046 Removed dependency. 2021-01-20 15:00:33 +00:00
Julian M. Kunkel 970c5ef139 Adjust complex tests for changed -z behavior. 2021-01-20 14:57:21 +00:00
Julian M. Kunkel 40c6d97e72 Replaced MPI_COMM_WORLD where needed with testComm. 2021-01-20 14:35:06 +00:00
Julian M. Kunkel e4120d600d Reduce the dependency to global MPI Communicator 2021-01-20 14:06:05 +00:00
Julian Kunkel 58fbefbd33
Bugfix IOR offsetting (fix before was incomplete). (#317) 2021-01-19 23:19:01 +00:00
Julian Kunkel 19ad73568a
Bugfix ior stonewalling case. (#316)
* Bugfix ior stonewalling case.
2021-01-19 22:28:34 +00:00
Julian Kunkel 3daf7a2d0f
Bugfix: valgrind memory issues (#314) 2021-01-19 21:23:30 +00:00
Mohamad Chaarawi fa316d5d24
dfs: conditionally compile out svcl setting (#313)
The SVCL argument is being removed from the DAOS API, so conditionally
compile out while maintaining backwards compatibility to versions where
 it's still required.
2021-01-19 13:43:01 +00:00
Julian Kunkel b2089514e3
MDTest calculate min/max/mean across iterations #300. (#312)
The calculation per iteration first computes the value of the slowest process, i.e., highest time or lowest rate. This is then the value for the iteration.
Secondly, calculate the min/max/mean across iterations.
For tree operations, the value is identical to previous as only Rank 0 is involved.
2021-01-18 21:30:16 +00:00
Julian Kunkel 33b70ecbc8
Fix MDTest multidir option to work with stonewall file. (#308) 2021-01-14 17:38:11 +00:00
Julian Kunkel 4a96436928
MDTest convert FATAL to WARN (#307) 2021-01-14 16:41:56 +00:00
Julian Kunkel d339caa501
Updated test (illustration of an empty test) and distclean for make distcheck #304 (#305)
Resolves #304
2021-01-04 10:50:38 -08:00
Julian M. Kunkel 21bf5a5a12 NCMPI ported to current ADIO. Fixed autoconf #303. 2020-12-25 16:49:04 +00:00
Glenn K. Lockwood eb883f533a update release docs based on 3.3.0 release experience 2020-12-23 18:17:47 -08:00
Glenn K. Lockwood e579531970 update NEWS for 3.3 release 2020-12-23 17:06:56 -08:00
Glenn K. Lockwood f467903cb9
draft of a new simplified release process (#297) 2020-12-23 11:56:49 +00:00
Julian Kunkel 8de13884a7
HDFS module ported to current AIORI API and improved (#302)
* HDFS module ported to current AIORI API and extended
* Added instructions how to build and run with a HDFS.
* Replaced read with pread to support offsets.
* Implemented HDFS MDTest API
* Improved sync semantics
2020-12-23 11:51:31 +00:00
Julian Kunkel 3e4a0f69d9
Merge pull request #296 from mchaarawi/mschaara/fix_dfs_stat_rc
aiori-DFS: stat should not be fatal
2020-12-16 15:53:43 +00:00
Mohamad Chaarawi 6675cd50bf aiori-DFS: stat should not be fatal
ior now expects stat to not be fatal. update the DFS driver to not
exit if stat fails since the file can simply not exist.

Signed-off-by: Mohamad Chaarawi <mohamad.chaarawi@intel.com>
2020-12-16 15:41:58 +00:00
Frank 0bd52884e8
added region and location to libs3 (#295)
* added region and location to libs3
2020-12-11 13:05:24 +00:00
Julian Kunkel fbee4139d8
Merge pull request #292 from ofaaland/b-fix-mpi-checkf
Do not execute functions twice in MPI_CHECKF
2020-12-04 09:20:21 +00:00
Julian Kunkel 21237e019a
Merge pull request #294 from hpc/fix-calc-stonewall
Fix calc stonewall
2020-12-03 21:11:41 +00:00
Julian M. Kunkel 69e006dd89 IOR clarify return API. 2020-12-03 21:07:45 +00:00
Julian M. Kunkel 073da08292 IOR: print overwrite check only if a write operation is requested. 2020-12-03 20:54:51 +00:00
Olaf Faaland 231868505d Do not execute functions twice in MPI_CHECKF
Assigning MPI_STATUS to a local variable and then referring to the
local will ensure that the same value is used in both the conditional
expression and the call to MPI_Error_string.

Otherwise, when MPI_STATUS is a function call, like

  MPI_CHECKF(fubar(), "%s", "error in fubar");

fubar() is called twice.  If there are underlying intermittent errors,
the error code/message for the first call is lost, with confusing output
like this:

  read      2206.18    17.27      145.93      262144     131072
  0.272595   291.88     0.290829   292.41
  ERROR: cannot access explicit, noncollective, MPI MPI_SUCCESS: no errors, (aiori-MPIIO.c:451)
2020-12-03 12:21:06 -08:00
Julian Kunkel a8ab766483
Merge pull request #270 from hpc/fix-offset-integration
Fix offset integration
2020-12-03 12:08:53 +00:00
Julian Kunkel 09d2c3511b
Merge pull request #289 from hpc/fix-ior-no-warn
IOR add warning if file exists #273. To stop on warnings, use existing flag --warningAsErrors
2020-12-02 10:41:44 +00:00
Julian Kunkel ffbdfb9bec
Merge pull request #288 from hpc/fix-modules
Fix modules
2020-12-02 10:22:22 +00:00
Julian Kunkel b417505dfd
Merge branch 'master' into fix-modules 2020-12-02 10:21:59 +00:00
Julian M. Kunkel f71a144b19 IOR raise warning for existing file once if not filePerProc. 2020-12-02 10:14:28 +00:00
Julian Kunkel 464f562f44
Merge pull request #290 from mchaarawi/dfs_coll_open
DFS: make use of collective open to share file handle
2020-12-02 09:15:37 +00:00
Mohamad Chaarawi 938cf2771b DFS: make use of collective open to share file handle
- remove un-needed libdaos_common lib since DAOS driver is removed
- use default container object class when it's not specified

Signed-off-by: Mohamad Chaarawi <mohamad.chaarawi@intel.com>
2020-12-01 21:36:25 +00:00
Julian M. Kunkel da03ae5c9f IOR add warning if file exists #273. To stop on warnings, use existing flag --warningAsErrors 2020-12-01 18:24:09 +00:00
Julian Kunkel 691324f17b
Merge pull request #287 from hpc/fix-warning
Fix compiler issue with redefinition via typedef.
2020-12-01 17:44:14 +00:00
Julian M. Kunkel 9dcf9f79e4 Moved barriers to right location. 2020-12-01 14:47:57 +00:00
Julian M. Kunkel 546eb05d15 Rudimentary support for stonewall added, addresses #261. 2020-12-01 14:45:07 +00:00
Julian M. Kunkel 5825dbae35 Provide new option randomPrefill for random (-z access only) that prefill the file with the specified blocksize, e.g., 2m. See issue #270.
This option works so far only without stonewalling!
2020-12-01 13:58:40 +00:00
Julian Kunkel d1db4cbfe0
Merge pull request #286 from rfmohr/fix-undeclared-unique_dir_per_task
mdtest: fix reference to unique_dir_per_task for builds --with-lustre
2020-12-01 09:40:39 +00:00
Richard Mohr 41184c69e2 mdtest: fix reference to unique_dir_per_task for builds --with-lustre 2020-12-01 01:22:31 -05:00
Julian Kunkel 4a3e4806bd
Merge pull request #281 from hpc/fix-mdtest-iter
Bugfix MDTest calculation of multiple iterations was incorrect.

Fix the bug reported by Rick to increase clarity. The previous offset calculation when using multiple iterations was:
for (i = start; i < stop; i++) // i = table position == test number
for (k=0; k < size; k++)
for (j = 0; j < iterations; j++)
value = all[(k * tableSize * iterations) + (j*tableSize) + i];

Note that the mean and min/max was then computed over these values.
But as the values were stored in memory in the order: iteration, rank, table
the correct term is: value = all[j * tableSize * size + k * tableSize + i];

Assume iterations = 2 and size = 3, the value for the test i=0 was computed from:
all[0 * 2 *tbl + 0 * tbl] = 0tbl
all[0 * 2 *tbl + 1 * tbl] = 1tbl
all[1 * 2 *tbl + 0 * tbl] = 2tbl
all[1 * 2 *tbl + 1 * tbl] = 3tbl
all[2 * 2 *tbl + 0 * tbl] = 4tbl
all[2 * 2 *tbl + 1 * tbl] = 5tbl

A more clear traversal would have been:
all[0 * 3 *tbl + 0 * tbl] = 0tbl
all[0 * 3 *tbl + 1 * tbl] = 1tbl
all[0 * 3 *tbl + 2 * tbl] = 2tbl
all[1 * 3 *tbl + 0 * tbl] = 3tbl
all[1 * 3 *tbl + 1 * tbl] = 4tbl
all[1 * 3 *tbl + 2 * tbl] = 5tbl

In that sense, it wasn't a functional bug but it decreased readability and now that we want to print the performance of the individual ranks, it is useful to fix this.
2020-11-30 14:17:42 +00:00
Julian M. Kunkel ae06908a93 Merge branch 'master' into fix-mdtest-iter 2020-11-30 14:16:41 +00:00
Julian Kunkel 312edaa99c
Merge pull request #283 from hpc/fix-mdtest-refactor
Refactoring: MDTest
This PR refactors the 50+ static variables to be part of a global option structure to remove ambiguity in which scope the variable is.

In the code, there is at the moment too much dependency to global variables.
Ultimately, the goal was to make stonewalling work with the multi-dir approach but it turned out this requires much more refactoring and therefore, doing it piece by piece.
2020-11-30 14:11:22 +00:00
Julian M. Kunkel 11c784c8bd Integrate review feedback. 2020-11-28 10:40:41 +00:00
Julian M. Kunkel ae23523a70 Integrate review feedback 2020-11-28 10:34:20 +00:00
Julian M. Kunkel 5799e4ef3a MDTest remove unnede variable. 2020-11-27 18:02:14 +00:00
Julian M. Kunkel fd51654393 Readd check for mdtest for now. 2020-11-27 17:51:31 +00:00
Julian M. Kunkel fbf976351a MDTest refactoring: move all global static variables into the global static structure. 2020-11-27 17:49:45 +00:00