An integrated dataset of near-surface Eulerian fields and Lagrangian trajectories from an ocean model

Elipot, Shane; Faigle, Eli; Arbic, Brian K.; Shriver, Jay F.

doi:10.1038/s41597-024-03813-z

Download PDF

Data Descriptor
Open access
Published: 29 August 2024

An integrated dataset of near-surface Eulerian fields and Lagrangian trajectories from an ocean model

Shane Elipot¹,
Eli Faigle ORCID: orcid.org/0009-0001-6723-531X¹,
Brian K. Arbic² &
…
Jay F. Shriver³

Scientific Data volume 11, Article number: 943 (2024) Cite this article

2629 Accesses
1 Citations
10 Altmetric
Metrics details

Subjects

Abstract

A dataset consisting of numerically simulated oceanic velocities and sea surface height changes, provided conjointly from Eulerian and Lagrangian points of view, is made available as cloud-optimized archives on a cloud storage platform for unrestricted access. The Eulerian component of the dataset comprises oceanic velocity components at 0 m and 15 m depth, as well as total and steric sea surface height changes, obtained at hourly time steps for one year, with an approximate horizontal resolution of 1/25 degree on an irregular global geographical spatial grid, from the HYbrid Coordinate Ocean Model. The Lagrangian component of the dataset comprises the trajectories of particles advected in the Eulerian velocity field of the model. The particles were advected forward and backward for 30 days from a regular 1/4 degree grid in order to achieve 60-day long trajectories at 0 m and 15 m depths, with start times separated by 30 days, in 11 releases. This integrated dataset may help to link Eulerian and Lagrangian observational perspectives.

A century-long eddy-resolving simulation of global oceanic large- and mesoscale state

Article Open access 11 November 2022

Lagrangian betweenness as a measure of bottlenecks in dynamical systems with oceanographic examples

Article Open access 16 August 2021

An ocean front dataset for the Mediterranean sea and southwest Indian ocean

Article Open access 21 October 2023

Background & Summary

The dataset presented¹ consists of numerically simulated oceanic velocities and sea surface height changes, provided conjointly from Eulerian and Lagrangian points of view. The overarching goal for these data is to be used by oceanographers and climate scientists who want to seek answers to questions related to near-surface ocean motions and sea level changes. Examples of such questions include: What are the spatial and temporal scales of oceanic currents variability? How does sea level vary in different oceanic regions? How do pollutants, plastics, organic or non-organic particles get dispersed and transported in the ocean? Another category of questions, which may be answered by this dataset because it provides both Eulerian and Lagrangian perspectives, is related to the way the global ocean is currently observed, or how it can be better observed in the future. For example, can we use freely drifting buoys to accurately estimate the kinetic energy of ocean currents? Can surface drifting buoys be equipped with global navigation satellite system to measure sea level changes²?

Because the global ocean cannot be observed everywhere all at once, oceanographic research notably relies on the data from numerical simulations to further understand the real ocean. The specific code, or ocean model, that generated the data comprising this current dataset, the HYbrid Coordinate Ocean Model (HYCOM), is publicly available (www.hycom.org). However, actually running the model and saving its Eulerian outputs, as well as generating the subsequent Lagrangian dataset described here, requires high performance computing and large physical storage, which are not commonly available. Therefore, this dataset aims at democratizing this advanced tool by being described here and by being available on a freely accessible cloud storage platform. A similar Eulerian dataset, emanating from another state-of-the-art ocean model (MITgcm, see³), is available through a data portal of the National Aeronautics and Space Administration (data.nas.nasa.gov/ecco/eccodata/llc_4320/). A comparison of these HYCOM and MITgcm simulations has been recently conducted to evaluate their respective strengths and limitations⁴.

This dataset presents an innovative integration of both Eulerian and Lagrangian perspectives on oceanographic data, thereby offering an enhanced framework for oceanographers to analyze and interpret ocean dynamics. The Eulerian view, derived from a conventional gridded representation native to HYCOM, and the Lagrangian view, based on simulated particle trajectories, are juxtaposed to provide comprehensive insights into oceanographic phenomena from fixed and mobile observation platforms. Such platforms include satellite observations and moored instruments, as well as drifting devices like surface buoys.

The dataset’s novelty extends beyond its availability to its true accessibility. The two components of the dataset, the Eulerian and the Lagrangian, are hosted on a single cloud platform (Amazon S3), and are formatted with zarr (zarr.readthedocs.io) to optimize cloud-based access and analyses. The Eulerian component of this dataset has already been available, yet from a University server requiring to log in and as 17,519 individual NetCDF files, and thus has not been amenable to facile access and analysis. In contrast, the Lagrangian component of the dataset represents a completely novel dataset component and has not been made available previously. This integrated dataset may help to link Eulerian and Lagrangian observational perspectives, and to enhance the accessibility and utility of oceanographic data for scientific research.

Methods

HYCOM simulation

The ocean velocity components (zonal and meridional) at 0 m and 15 m depth, and sea surface height (SSH) variables (total and steric), are from the outputs of a 1-year simulation (calendar year 2014) of the HYbrid Coordinate Ocean Model (HYCOM), run in a non-assimilative configuration. Previous publications describe the underlying dynamical core⁵ and the details of the simulation used here⁴ and are succinctly reported here. This HYCOM simulation is global, with an approximate horizontal resolution of 1/25 degree (4.4 km at the Equator), includes 41 hybrid vertical layers, and is integrated with a 75-second baroclinic time step⁵. HYCOM utilizes terrain-following coordinates in coastal regions and isopycnal coordinates in open ocean regions. In the upper open ocean, HYCOM comprises 14 z-coordinates layers with thickness varying between 1.00 m and 6.87 m within the top 30 m of the ocean. Ocean velocity at the surface, or 0 m, actually refers to velocity of the top layer of the model which has a thickness of 1 m. The 15 m ocean velocities were derived by interpolating linearly the velocities of the two layers centered at 13.185 m and 18.55 m depths. Such depth levels were originally chosen with the purpose of comparing the modeled velocities to the data of the Global Surface Drifter Array/NOAA Global Drifter Program^4,6. This program maintains an array of approximately 1300 surface drifters equipped with a drogue centered at 15 m depth and their displacements are expected to be representative of oceanic velocities at that depth. Once the drogue of a drifter detaches, its displacements are closer to be representative of oceanic velocities at the surface⁷.

HYCOM uses the K profile parameterization (known as KPP) for vertical mixing within the ocean surface boundary layer⁸. This HYCOM simulation is forced by 3-hour atmospheric fields from the U.S. Navy Global Environmental Model⁹, converted to surface fluxes using bulk formulas¹⁰. The wind stress is calculated using 10-m wind speed relative to the model surface ocean speed. This HYCOM simulation includes tidal forcing with the two largest diurnal components (K₁ and O₁) and the three largest semidiurnal components (M₂, S₂, and N₂). The Self-attraction and Loading forcing term¹¹ is applied from the altimetry-constrained TPXO8 barotropic tide model¹². The HYCOM parameterized topographic wave drag scheme¹³ is tuned to minimize the M₂ surface elevation errors with respect to TPXO8.

HYCOM uses a discretization of the globe with a C-grid for which ocean velocity components are defined on staggered grids, and SSH variables are defined on a grid interspaced between the velocity grids. For this dataset, however, outputs of the model were interpolated and archived on a common (X, Y) spatial grid with dimensions (9000, 7055). The latitude and longitude of the points on this grid are non-regular, placing two “north poles” on the American and Eurasian continental masses. As such, the latitude is not constant along the X direction at high Y values, and the longitude is not constant along the Y direction at high and low Y values. Figure 1 displays the latitude, longitude, and model depth values on this geographically non-regular grid.

Particle trajectory simulations

Release strategy and trajectory assembly

Using the Ocean Parcels Python software version 2.4.1.dev9^14,15, we released numerical particles in the 0 m and 15 m hourly velocity fields of the HYCOM simulation. We devised a release strategy in order to minimize sampling biases that may occur in horizontally divergent regions of the ocean. Using hourly velocity fields, the particles were advected with an integration time step of 20 minutes forward in time for 30 days, as well as backward in time for 30 days, from common release points. These points define a regular geographical grid at 1/4 degree intervals, from −180 degrees E to 179.75 degrees E in longitude, and from −70 degrees N to 75 degrees N in latitude. Forward and backward advections were conducted at both 0 m and 15 m depths, starting from 2014-01-31 01:00:00 and subsequently every 30 days until 2014-11-27 01:00:00, thus forming 11 releases. Particle positions were output and saved at the top of each hour. Next, we assembled the forward and backward trajectories to form 60-day long (or 1440-hour long) physically-consistent particle trajectories. As a consequence, after 30 days from the common date and time start of each release experiment, the assembled trajectories are such that all particles are simultaneously located on the regular 1/4 degree grid. As the particles were being advected by Ocean Parcels, they sampled the model depth and the time-varying model SSH and steric SSH. Particles velocities were subsequently calculated at every time step with a forward difference scheme using the clouddrift Python software¹⁶.

The number of particles successfully and originally released at 0 m and at 15 m for each of the 11 releases differs slightly because of the hypsometric changes of the model between these two depth levels, and because particles were retained upon release depending on their initial speed (Eulerian speed interpolated at release locations less than 1 × 10⁻¹² m s⁻¹ was used to detect and discard particles effectively released on land). More particles were successfully released at 0 m compared to 15 m because the ocean surface area is larger at 0 m compared to 15 m. We formed a consistent dataset within each release depth by keeping only the particles for which we found common identification numbers (id variable, as assigned by Ocean Parcels) across all 11 temporal sets. We identified 593,292 and 587,225 trajectories with common id numbers at 0 m and 15 m, respectively. All particle ids in the 15-m sets can also be found in the 0-m sets because numerical ids are assigned by Parcels based on the common release arrangement for the forward and backward experiments. As an example, because the effective release location of the particles is the middle step of the assembled trajectories (720 out of 1440), all particles with id 105763 are located at longitude −134.5 and latitude −64.75 at time step 720 (719 for 0-based indices) of their respective 1440-step trajectories.

Mask definition for grounded particles

We include with the particle trajectory data a determination of particle grounding, also known as beaching, when a particle is advected with infinitesimally decreasing speed towards a coastal boundary of the HYCOM model where the velocity is null. This occurred in our experiments because we did not implement any specific boundary condition along coastal grid cells to prevent grounding. Because the particle advections were conducted forward in time for 30 days and backward in time for 30 days from a regular grid on the sphere, particles could become grounded either at the beginning or at the end of the 60-day long assembled trajectories. We proceed to determine grounding as follows, based on a trial and error approach. For grounding occurring during the 30-day forward advection period, we look for the first period longer than 24 hours during which a particle speed is less than 0.003 m s⁻¹, equivalent to a displacement less than 259.2 m over 24 hours. We then take the first time step of that 24-hour period to be the grounding time. This time and all subsequent times are therefore flagged as grounded. For grounding occurring during the 30-day backward advection period, we proceed with the same algorithm but reversed in time: we look for the last period longer than 24 hours during which a particle speed is less than 0.003 m s⁻¹. We then take the last time step of that 24-hour period to be the grounding time. This time and all previous times are therefore flagged as grounded. Relatively few particle locations in the dataset are flagged as grounded following the scheme just described. Grounding occurs in about 5% of the trajectories for 1.4% of the total number of data points at 0 m, and in about 2.5% of the trajectories for 0.7% of the total number of data points at 15 m.

Data Records

The dataset version 1.0 has been deposited on Zenodo at https://doi.org/10.5281/zenodo.12796468¹. The data files are hosted on the Registry of Open Data on Amazon Web Services (AWS), for which the RODA page is https://registry.opendata.aws/hycom-global-drifters/. The files are stored in an AWS S3 bucket with Amazon Resource Name (ARN) arn:aws:s3:::hycom-global-drifters. A “README” file, directly accessible at https://hycom-global-drifters.s3.amazonaws.com/README.txt, provides descriptions of the files within the S3 bucket.

Eulerian files

The Eulerian fields of velocity components and total and steric SSH variables are originally available as NetCDF files through the OSiRIS/Globus infrastructure¹⁷. From this archive, 8759 individual files for velocity and 8759 individual files for SSH can be downloaded, totaling approximately 28TB. Each file corresponds to one hourly time step, from 2014-01-01 01:00:00 to 2014-12-31 23:00:00, with each variable provided on the non-regular (X, Y) grid. Because this archival structure is not amenable to facile analyses along the time dimension, we reformatted these data utilizing the capabilities of the zarr file format (zarr.readthedocs.io) which has been designed for the storage of chunked and compressed multidimensional data. This format is well-suited for cloud computing and storage systems such as the AWS S3 bucket used for this dataset.

Python scripts utilizing the xarray package (docs.xarray.dev) were used to divide and combine the individual NetCDF files from OSiRIS¹⁷ into 12 zarr stores for the velocity variables and 12 zarr stores for the SSH variables. Specifically, we combined the 8759 hourly time steps of the data into 11 consecutive sets of 720 hour or 30 days, and one final and twelfth set of 839 hours (34 days and one hour) to complete the time period. Zarr stores can hold large data arrays by breaking them up in smaller arrays, or chunks, which are contiguously stored in computer memory, and which can be handled independently when performing parallel computations. As such, the zarr stores were further processed using the Python rechunker package (rechunker.readthedocs.io). From the original chunking of the NetCDF files, we achieved in general (720, 1, 1, 9000) chunks for variables with dimensions (time, Depth, Y, X) and (720, 1, 9000) chunks for variables with dimensions (time, Y, X). As such, the individual zarr stores are optimized for analysis along the time and X dimensions.

Table 1 lists the characteristics and data variables of the Eulerian zarr stores with names hycom12-x-rechunked-corr.zarr and hycom12-ssh-x-rechunked-corr.zarr, with x from 1 to 12. For these, Table 2 lists the first and last date and time, the number of steps, and the 1-based indices.

Table 1 Characteristics and data variables of the Eulerian zarr stores.

Full size table

Table 2 Date and time characteristics of the Eulerian zarr stores.

Full size table

The combined size of the velocity and SSH zarr stores is approximately 5.6 TB, which corresponds to a 65% size reduction compared to the OSiRIS archive. Note that the NetCDF files for velocity on OSiRIS (2024) contain two additional variables, the zonal and meridional components of ocean bottom velocity, which are not included in the zarr archives in the AWS S3 bucket.

A single zarr store hycom12-bathy.zarr is provided for the model on the (X, Y) grid. Grid points located on land are indicated by Not-a-Number (NaN) values (Fig. 1c).

Lagrangian files

The Lagrangian dataset is constituted of 22 zarr stores. There are 11 stores for numerical particles advected at the surface of the model (0 m) in stores global_hycom_0m_step_x.zarr with x from 1 to 11, and 11 stores for numerical particles advected at 15 m depth in stores global_hycom_15m_step_x.zarr with x from 1 to 11. Each zarr store contains either 593,292 (0 m) or 587,225 (15 m) particle trajectory data at 1440 hourly time steps forming 60 days. The characteristics and data variables of the Lagrangian zarr stores are listed in Table 3. The start, median, and end dates and times of the 11 sets of trajectories at each depth are listed in Table 4.

Table 3 Characteristics and data variables of the Lagrangian zarr stores.

Full size table

Table 4 Start, middle, and end dates for the 11 Lagrangian sets of particle data at each depth.

Full size table

The zarr chunks for the data variables are (23301, 1440) except for grounding which are (93206, 1440) and time which are (9176, 45). As such, the data are generally optimized for analysis along the obs dimension, that is along trajectories (or equivalently time).

The coordinate variable id is a tagging number assigned to a particle by the Ocean Parcels software. The same id numbers are all found in each of the 11 sets at 0 m, and the same is true for the 11 sets at 15 m. There are 587,225 common ids between a set at 0 m and a set at 15 m.

Technical Validation

This HYCOM Eulerian dataset of 0 m and 15 m velocities has been previously compared to the same depth velocities of another global numerical model of comparable characteristics (the Massachusetts Institute of Technology general circulation model or MITgcm), as well as against in situ estimates of oceanic velocities from the drogued and undrogued drifters of the NOAA Global Drifter Program⁴. This three-way comparisons, two numerical global models and one in situ global dataset, cannot provide a strict and complete validation of the HYCOM velocity dataset for reasons too numerous to list here. Yet, the comparison of⁴ demonstrated the reliability and usefulness of the HYCOM Eulerian velocity dataset to investigate a variety of oceanic processes.

We now turn to validating the Lagrangian dataset. Figure 2 displays the count of Lagrangian particle locations, or particle density, per 1/4 degree bins, on average, for the 11 sets of trajectory, at 0 m and at 15 m depth. Because the particles within each set were released on a regular 1/4 degree grid, if the particles remained immobile over the 1440 time steps of the trajectories, the density would uniformly be 1440. It is not so because the particles were obviously advected by the simulated ocean currents of the model, yet the distribution of the spatial density values peaks at 1440 for both depth levels (distribution were calculated for grid cells between 70°S and 75.25°N). The relative spatial uniformity of these maps is a result of the Lagrangian release scheme and the assembly of forward and backward trajectories. The largest departures from the 1440-value density are found in known energetic oceanic regions with intense horizontal divergence and convergence, such as western boundary currents and along the Equator.

We now examine the particle velocities calculated from displacements by comparing the velocity variance at both depths, from the Lagrangian particles on one hand, to the velocity variance from the Eulerian velocity fields on the other hand (Fig. 3). The velocity variance is calculated from either all Lagrangian velocities or every other three Eulerian velocities, contained within 1/2 degree bins for the time period 2014-01-01 01:00:00 to 2014-03-02 00:00:00, which corresponds to the first sets of Lagrangian data at 0 m and 15 m. Within that figure, visual examination of panels a and d versus panels d and e shows how the Lagrangian velocities correspond closely to the Eulerian ones. The correlation and regression coefficients between the Lagrangian and Eulerian variance values are 0.95 and 0.78 at 0 m and 0.93 and 0.86 at 15 m (similar statistics are found for the other 60-day periods). The correspondence between Lagrangian and Eulerian velocities is further shown in panels c and f that display two-dimensional histograms of the Eulerian versus Lagrangian binned variance values at 0 m and 15 m, respectively. Velocity variance estimates from Eulerian and Lagrangian perspectives are not expected to strictly correspond, for kinematic reasons that are beyond the scope of this data descriptor paper¹⁸. However, the close relationships illustrated here constitute an attestation of the ability of the Lagrangian velocity dataset to capture the overall dynamics simulated by HYCOM.

Usage Notes

We have written a number of Jupyter notebooks in Python to demonstrate some possible usages of this dataset. The notebooks provided leverage the zarr nature of the archives stored in an AWS S3 bucket, which when combined with the capabilities of the xarray package, allows users to “lazily” open the datasets without the need to download the data to their local computer, until the data are required for some explicit computations.

Seven notebooks are provided in a single GitHub repository, the same repository containing the Python scripts that were used to generate and assemble this dataset, located at https://github.com/selipot/hycom-oceantrack, and archived on Zenodo¹⁹. The notebooks are succinctly described below.

tutorials/hycom_eulerian_tutorial.ipynb: In this notebook tutorial, one can learn how to open the Eulerian data from the AWS S3 bucket and how to conduct some simple analysis of the velocity and SSH data. This notebook also provides the necessary code to generate a figure similar to Fig. 1.
tutorials/hycom_lagrangian_tutorial.ipynb: In this notebook tutorial, one can learn how to access the Lagrangian data and how to conduct some simple analysis of the velocity and SSH data. This notebook also shows how to utilize the grounding mask for particles and subsequently use the clouddrift Python package²⁰ to convert the original two-dimensional Lagrangian dataset into a ragged array dataset for further Lagrangian analyses.
tutorials/generate_lagrangian_animation.ipynb: In this notebook, one can learn how to generate a simple animation of particle motions for 60-days.
tutorials/calculate_velocity_variance.ipynb: In this notebook, one can learn how to generate maps of Eulerian and Lagrangian kinetic energy such as the ones displayed in Fig. 3 of this paper.
paper-figures/figure1.ipynb: This notebook generates Fig. 1 of this paper.
paper-figures/figure2.ipynb: This notebook generates Fig. 2 of this paper.
paper-figures/figure3.ipynb: This notebook generates Fig. 3 of this paper.

Code availability

The general numerical code for HYCOM is available from the a consortium which maintains the source code at a GitHub repository at https://github.com/HYCOM.

For this HYCOM-OceanTrack dataset¹, we have created a dedicated and public GitHub repository at https://github.com/selipot/hycom-oceantrackand archived on Zenodo¹⁹. This repository contains the following:

• A suite of Python scripts used to create the cloud-optimized files containing the Eulerian fields of this dataset (See Data Records).

• An example Python script showing how the Lagrangian particle advection and field sampling of the model Eulerian fields were conducted using the Ocean Parcels software (See Methods).

• An example Python script showing how the cloud-optimized files containing the Lagrangian data of this dataset (See Methods) were created.

• A suite of Jupyter notebooks written in Python to illustrate possible usages of the dataset (See Usage Notes).

References

Elipot, S., Faigle, E., Arbic, B. K. & Shriver, J. F. HYCOM-OceanTrack: Integrated HYCOM Eulerian Fields and Lagrangian Trajectories Dataset, v1.0, Zenodo https://doi.org/10.5281/zenodo.12796468 (2024).
Elipot, S. Measuring global mean sea level changes with surface drifting buoys. Geophysical Research Letters 47, e2020GL091078, https://doi.org/10.1029/2020GL091078 (2020).
Article ADS Google Scholar
Arbic, B. K. et al. A primer on global internal tide and internal gravity wave continuum modeling in hycom and mitgcm. New frontiers in operational oceanography https://doi.org/10.17125/gov2018.ch13 (2018).
Arbic, B. K. et al. Near–Surface Oceanic Kinetic Energy Distributions From Drifter Observations and Numerical Models. J. Geophys. Res. Oceans127, https://doi.org/10.1029/2022JC018551 (2022).
Bleck, R. An oceanic general circulation model framed in hybrid isopycnic-cartesian coordinates. Ocean modelling 4, 55–88, https://doi.org/10.1016/S1463-5003(01)00012-9 (2002).
Article ADS Google Scholar
Lumpkin, R. & Pazos, M.Measuring surface currents with Surface Velocity Program drifters: the instrument, its data, and some recent results, 39-67 (Cambridge University Press, 2007).
Laurindo, L. C., Mariano, A. J. & Lumpkin, R. An improved near-surface velocity climatology for the global ocean from drifter observations. Deep Sea Research Part I: Oceanographic Research Papers 124, 73–92, https://doi.org/10.1016/j.dsr.2017.04.009 (2017).
Article ADS Google Scholar
Large, W. G., McWilliams, J. C. & Doney, S. C. Oceanic vertical mixing: a review and a model with a nonlocal boundary layer parameterization. Rev. Geophys. 32, 363–404, https://doi.org/10.1029/94RG01872 (1994).
Article ADS Google Scholar
Hogan, T. F. et al. The Navy global environmental model. Oceanography 27, 116–125 (2014).
Article Google Scholar
Kara, A. B., Rochford, P. A. & Hurlburt, H. E. Efficient and accurate bulk parameterizations of air–sea fluxes for use in general circulation models. J. Atmos. Oceanic Technol. 17, 1421–1438, https://doi.org/10.1175/1520-0426(2000)017_1421:EAABPO_2.0.CO;2 (2000).
Article ADS Google Scholar
Ray, R. Ocean self-attraction and loading in numerical tidal models. Marine Geodesy 21, 181–192, https://doi.org/10.1080/01490419809388134 (1998).
Article ADS Google Scholar
Egbert, G. D. & Erofeeva, S. Y. Efficient Inverse Modeling of Barotropic Ocean Tides. J. Atmos. Oceanic Technol. 19, 183–204, https://doi.org/10.1175/1520-0426(2002)019_0183:EIMOBO_2.0.CO;2 (2002).
Article ADS Google Scholar
Jayne, S. R. & St. Laurent, L. C. Parameterizing tidal dissipation over rough topography. Geophys. Res. Lett. 28, 811–814, https://doi.org/10.1029/2000GL012044 (2001).
Article ADS Google Scholar
Lange, M. & van Sebille, E. Parcels v0.9: prototyping a Lagrangian ocean analysis framework for the petascale age. Geoscientific Model Development 10, 4175–4186, https://doi.org/10.5194/gmd-10-4175-2017 (2017).
Article ADS Google Scholar
Delandmeter, P. & van Sebille, E. The Parcels v2.0 Lagrangian framework: new field interpolation schemes. Geoscientific Model Development 12, 3571–3584, https://doi.org/10.5194/gmd-12-3571-2019 (2019).
Article ADS Google Scholar
Elipot, S., Miron, P., Curcic, M., Santana, K. & Lumpkin, R. Clouddrift: a Python package to accelerate the use of Lagrangian data for atmospheric, oceanic, and climate sciences. Journal of Open Source Software 9, 6742, https://doi.org/10.21105/joss.06742 (2024).
Article Google Scholar
OSiRIS. Non-assim HYCOM 1/25 SSH and Surface velocities for 2014, https://app.globus.org/file-manager?origin_id=e658b2aa-c5b5-455c-8bd9-49d361ac0bda&origin_path=%2F (2024).
Davis, R. E. On relating Eulerian and Lagrangian velocity statistics: single particles in homogeneous flows. Journal of fluid mechanics 114, 1–26 (1982).
Article ADS MathSciNet Google Scholar
Elipot, S. & Faigle, E. selipot/hycom-oceantrack: v1.0.2, Zenodo https://doi.org/10.5281/zenodo.13327379 (2024).
Elipot, S., Miron, P., Curcic, M., Santana, K. & Lumpkin, R. Cloud-drift/clouddrift: v0.40.0, Zenodo https://doi.org/10.5281/zenodo.12800936 (2024).

Download references

Acknowledgements

Shane Elipot and Eli Faigle were supported by NSF award number 1851166. Shane Elipot was additionally supported by a Provost Research Award from the University of Miami. Brian K. Arbic was supported by the Office of Naval Research (ONR) NISKINE Grant N00014-18-1-2544. Jay F. Shriver acknowledges funding support from Office of Naval Research (ONR) Grant N0001423WX01413, which is a component of the Global Internal Waves project of the National Oceanographic Partnership Program (https://nopp-giw.ucsd.edu/). This is NRL contribution NRL/JA/7320-24-6441 and has been approved for public release.

Author information

Authors and Affiliations

Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL 33149, USA
Shane Elipot & Eli Faigle
Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, MI 48109, USA
Brian K. Arbic
Ocean Sciences Division, Naval Research Laboratory, Stennis Space Center, MS 39529, Mississippi, USA
Jay F. Shriver

Authors

Shane Elipot
View author publications
Search author on:PubMed Google Scholar
Eli Faigle
View author publications
Search author on:PubMed Google Scholar
Brian K. Arbic
View author publications
Search author on:PubMed Google Scholar
Jay F. Shriver
View author publications
Search author on:PubMed Google Scholar

Contributions

S.E. devised the dataset, wrote and edited the manuscript, conducted the generation of zarr stores for the Eulerian dataset, conducted the Lagrangian seeding experiment with the Ocean Parcels Software on the Triton supercomputer of the University of Miami, conducted the generation of the zarr stores for the Lagrangian dataset, and co-wrote the Python notebooks listed in Usage Notes. E.F. edited the manuscript, conducted the analyses for the technical validation and produced Figs. 2 and 3, and co-wrote the Python notebooks described in Usage Notes. B.A. edited the manuscript. J.S. led the project of conducting the original HYCOM simulation and edited the manuscript.

Corresponding author

Correspondence to Shane Elipot.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Elipot, S., Faigle, E., Arbic, B.K. et al. An integrated dataset of near-surface Eulerian fields and Lagrangian trajectories from an ocean model. Sci Data 11, 943 (2024). https://doi.org/10.1038/s41597-024-03813-z

Download citation

Received: 15 May 2024
Accepted: 20 August 2024
Published: 29 August 2024
Version of record: 29 August 2024
DOI: https://doi.org/10.1038/s41597-024-03813-z