Version 0.14

Version 0.14.1

Trains

Features and Bug Fixes

  • Add OpenMPI/Slurm support, support TRAINS_FORCE_MASTER_NODE environment variable to allow forcefully specifying a node is master (not based on the MPI Rank).
  • Add Task.get_models() and Task.models for retrieving stored models for previously executed tasks
  • Support multiple EventWriter in TensorFlow eager mode (TF 2.0+) (Issue #26).
  • Support reusing models.
  • Fix TensorFlow not a number (NaN) and infinity (Inf) values support (Issue #118).
  • Fix joblib binding.
  • Fix metrics/files/models upload on shutdown.
  • Fix support for Task init/close multiple times.
  • Update manual_reporting.py example with Logger.report_table() example.
  • Deprecate Task.set_model_config(), Task.get_model_config_text() and Task.get_model_config_dict().

Trains Server

Features and Bug Fixes

  • Support default Trains Server Kubernetes Helm ports in the Trains WebApp.
  • Fix issue switching scalar plots between linear and logarithmic views (Issue #75).
  • Do not reset view filtering when switching tabs in experiment view.

Trains Agent

Features and Bug Fixes

  • Add daemon detached mode (--detached, -d) that runs the agent as daemon in the background and returns immediately.
  • Auto mount ~/.git-credentials into a Docker container, if the file exists.
  • Add TRAINS_AGENT_EXTRA_PYTHON_PATH environment variable to allow the adding of an additional Python path during experiment execution (helpful when using extra un-tracked modules).
  • Fix "run as user" feature (using TRAINS_AGENT_EXEC_USER environment variable).
  • Fix PyTorch support to ignore minor versions when looking for package to install/download.
  • Fix experiment execution output handling.

Version 0.14.0

Trains

Features and Bug Fixes

  • Add support for reporting tables (pandas.DataFrame, CSV files, URL to CSV file) using the Logger.report_table() method.
  • Add support for setting a Task's initial iteration to allow continuing previous runs using the Task.set_initial_iteration() and Task.get_initial_iteration() methods.
  • Add multiprocess task support (Issue #96).
  • Add trains-init support for config file override using the TRAINS_CONFIG_FILE environment variable or --file command line argument (Issue #105).
  • Add initial slurm support.
  • Allow disabling repository detection when calling Task.init().
  • Add Jupyter Notebook server execution example.
  • Update examples.
  • Improve warning messages for storage errors and missing storage packages.
  • Fix HTTP link quoting in stored links.

Trains Server

Features and Bug Fixes

  • Add debug image viewing for an unlimited number of iterations, filtering by metric, and comparing debug images for different iterations.
  • Add debug image view zoom, fit to screen, resize, hover coordinates (Issue #68).
  • Add switching scalar plots between linear and logarithmic views (Issue #75).
  • Add scalar minimums and maximums to comparing experiments.
  • Add opening HTML artifact files in the browser.
  • Add Web-App notification for version updates.
  • Add toggle button to switch between full screen and standard views in the RESULTS tab.
  • Add a load more button to the global search results.
  • Add automatic detection of debug video type from the file extension.
  • Add Task.get_parameters_as_dict() and Task.set_parameters_as_dict().
  • Allow editing model description.
  • Allow editing experiment uncommitted changes and installed packages separately.
  • Allow deleting an input model from a draft experiment.
  • Fix dialog elements which are now accessible in non-100% screen scale. (Issue #89).
  • Fix custom columns in the experiments table not properly loaded from browser store (Issue #106).
  • Fix issue with showing 2D graphs containing sub-plots (trains #slack)
  • Fix issues with vertical scrolling of experiment logs.
  • Fix issue with image plots sometimes not visible in experiments comparison.

Trains Agent

Features and Bug Fixes

  • Add support for trains-agent execute --id <experiment-id> --docker that allows executing a specific experiment inside a docker container.
  • Add support for trains-agent execute --id <template-experiment-id> --clone that clones the provided experiment and executes the cloned experiment.
  • Add support for APIClient.models.delete() to allow programmatically deleting a model (Trains Server Issue 32.
  • Add daemon support for passing storage-related OS environment variables to experiments executed inside a Docker container (supported by trains>=0.13.3):
    • AWS: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY and AWS_DEFAULT_REGION
    • Azure: AZURE_STORAGE_ACCOUNT and AZURE_STORAGE_KEY
    • Google: GOOGLE_APPLICATION_CREDENTIALS
  • Fix Git checkout with submodules (Issue #112).
  • Docker image in command line takes precedence over the Docker image specified in an experiment.