Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
L
lammps
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
multiscale
lammps
Commits
9a8c0928
Commit
9a8c0928
authored
6 years ago
by
Axel Kohlmeyer
Browse files
Options
Downloads
Patches
Plain Diff
use Hyper-Threading consistently
parent
eca0487d
No related branches found
Branches containing commit
No related tags found
No related merge requests found
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
doc/src/Speed_kokkos.txt
+7
-7
7 additions, 7 deletions
doc/src/Speed_kokkos.txt
doc/src/package.txt
+1
-1
1 addition, 1 deletion
doc/src/package.txt
with
8 additions
and
8 deletions
doc/src/Speed_kokkos.txt
+
7
−
7
View file @
9a8c0928
...
@@ -93,12 +93,12 @@ The "t Nt" option specifies how many OpenMP threads per MPI task to
...
@@ -93,12 +93,12 @@ The "t Nt" option specifies how many OpenMP threads per MPI task to
use with a node. The default is Nt = 1, which is MPI-only mode. Note
use with a node. The default is Nt = 1, which is MPI-only mode. Note
that the product of MPI tasks * OpenMP threads/task should not exceed
that the product of MPI tasks * OpenMP threads/task should not exceed
the physical number of cores (on a node), otherwise performance will
the physical number of cores (on a node), otherwise performance will
suffer. If
h
yper
t
hreading is enabled, then the product of MPI
tasks *
suffer. If
H
yper
-T
hreading
(HT)
is enabled, then the product of MPI
OpenMP threads/task should not exceed the physical number of
cores *
tasks *
OpenMP threads/task should not exceed the physical number of
hardware threads. The "-k on" switch also issues a
"package kokkos"
cores *
hardware threads. The "-k on" switch also issues a
command (with no additional arguments) which sets
various KOKKOS
"package kokkos"
command (with no additional arguments) which sets
options to default values, as discussed on the
"package"_package.html
various KOKKOS
options to default values, as discussed on the
command doc page.
"package"_package.html
command doc page.
The "-sf kk" "command-line switch"_Run_options.html will automatically
The "-sf kk" "command-line switch"_Run_options.html will automatically
append the "/kk" suffix to styles that support it. In this manner no
append the "/kk" suffix to styles that support it. In this manner no
...
@@ -149,7 +149,7 @@ Intel Knight's Landing (KNL) Xeon Phi:
...
@@ -149,7 +149,7 @@ Intel Knight's Landing (KNL) Xeon Phi:
KNL Intel Phi chips have 68 physical cores. Typically 1 to 4 cores are
KNL Intel Phi chips have 68 physical cores. Typically 1 to 4 cores are
reserved for the OS, and only 64 or 66 cores are used. Each core has 4
reserved for the OS, and only 64 or 66 cores are used. Each core has 4
h
yper
t
hreads,so there are effectively N = 256 (4*64) or N = 264 (4*66)
H
yper
-T
hreads,so there are effectively N = 256 (4*64) or N = 264 (4*66)
cores to run on. The product of MPI tasks * OpenMP threads/task should
cores to run on. The product of MPI tasks * OpenMP threads/task should
not exceed this limit, otherwise performance will suffer. Note that
not exceed this limit, otherwise performance will suffer. Note that
with the KOKKOS package you do not need to specify how many KNLs there
with the KOKKOS package you do not need to specify how many KNLs there
...
...
This diff is collapsed.
Click to expand it.
doc/src/package.txt
+
1
−
1
View file @
9a8c0928
...
@@ -363,7 +363,7 @@ specified with the OMP_NUM_THREADS environment variable or the {omp}
...
@@ -363,7 +363,7 @@ specified with the OMP_NUM_THREADS environment variable or the {omp}
keyword). The extra thread is dedicated for performing part of the
keyword). The extra thread is dedicated for performing part of the
"PPPM solver"_kspace_style.html computations and communications. This
"PPPM solver"_kspace_style.html computations and communications. This
can improve parallel performance on processors supporting
can improve parallel performance on processors supporting
Simultaneous Multithreading (SMT) such as Hyper
t
hreading on Intel
Simultaneous Multithreading (SMT) such as Hyper
-T
hreading
(HT)
on Intel
processors. In this mode, one additional thread is generated per MPI
processors. In this mode, one additional thread is generated per MPI
process. LAMMPS will generate a warning in the case that more threads
process. LAMMPS will generate a warning in the case that more threads
are used than available in SMT hardware on a node. If the PPPM solver
are used than available in SMT hardware on a node. If the PPPM solver
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment