Articles | Volume 17, issue 1
https://doi.org/10.5194/ms-17-141-2026
https://doi.org/10.5194/ms-17-141-2026
Research article
 | 
26 Feb 2026
Research article |  | 26 Feb 2026

A novel neural network model for rolling linear guide pair optimization design

Chenghao Song, Weiqi Du, Shuxin Li, and Junjun Han
Abstract

This paper addresses the inefficiency of conventional design methods for rolling linear guide pairs, which rely on finite-element analysis and large data demands. This study pioneers a few-shot learning framework based on meta-learning, employing a model-agnostic meta-learning strategy to train an inverted-bottleneck residual fully connected network. The network achieves high prediction accuracy (R2>0.91) with only 126 samples. An integrated parametric platform reduces modelling time from 50 to 2–3 min, significantly improving efficiency. Optimization via sequential least squares quadratic programming demonstrates a 57 % reduction in vertical deformation. However, the current work focuses on static performance optimization, leaving dynamic aspects for future research. This approach offers an efficient and data-effective paradigm for precision mechanical component design.

Share
1 Introduction

The rolling linear guide is a core transmission component in precision mechanical equipment. Its structural performance directly determines the positioning accuracy and service life of high-end equipment, such as computer numerical control (CNC) machine tools and industrial robots (Yang et al., 2020; Wang et al., 2018; Zha et al., 2024; Quan and Zhao, 2024). The traditional guide design method relies much on empirical trial and error (Phuyal et al., 2020; Sahoo and Lo, 2022; Liu et al., 2023; Staroszyk et al., 2024). Designers repeatedly adjust geometric parameters and conduct the computer-aided engineering (CAE) analysis to improve mechanical performance, which is quite time-consuming (Zou and Wang, 2015; Tong et al., 2020). Therefore, a rapid and effective design method should be developed for the structural optimization of a rolling linear guide.

Several studies have been conducted for the investigation of rolling linear guide pairs. For instance, Shimizu (1998) and Ohta and Hayashi (2000) were dedicated to developing rigid models for rolling linear guides based on Hertz contact theory. Cheng (2021) introduced a novel stiffness matrix for roller linear guides, investigating the relationship between non-uniform load distribution and contact states. Xu et al. (2023) developed an analytical dynamic model for flexible linear guides, investigating the vibration behaviour of carriages under external periodic excitations in multiple directions. Compared to numerical analysis methods, the finite-element method (FEM) provides higher computational efficiency and superior adaptability. For example, Sun et al. (2015) established a high-fidelity finite-element model accounting for ball-raceway contact. Wei et al. (2022) used FEM to characterize the impact of structural parameters on guide pair mechanical performance. Tong et al. (2019) employed FEM to develop a 5-degree-of-freedom model for calculating the fully occupied stiffness matrix of linear ball guides. Up until now, few studies have paid attention to the structural optimization of rolling linear guides. In addition, structural optimization design with the results of FEM analysis is quite time-consuming and labour-intensive because FEM analysis cannot directly give the explicit relationships between structure and performance.

Traditional optimization algorithms like PSO and GA are highly effective in handling complex problems. However, they require thousands of computations on high-precision models, and so the cycle is very time-consuming. For example, Tong et al. (2020) employed a particle swarm optimization algorithm, using 2000 particles and running for 150 iterations to optimize the design of the linear guide. Wang et al. (2016) utilized finite-element analysis and a genetic algorithm model, conducting 1580 calculations to optimize the structure of a wind turbine blade. In recent years, machine learning techniques (Ramu et al., 2022) have offered another option for designers. For instance, Kazem et al. (2025) proposed a RAGN-R (surrogate multi-subject ensemble machine learning method) for estimating the mechanical properties of advanced materials. This method can directly establish the connection between structure and performance, enabling highly accurate predictions. Up to now, there have been many types of surrogate models. For instance, Fishwick (1989) used a multi-layer neural network model as a simulation model for a basic ballistic model. Jiang et al. (2025) proposed a multi-fidelity fully connected neural network (FCNN) model to optimize the fin structure of rocket projectiles. Roy et al. (2019) developed a support vector regression (SVR) model for the reliability estimation of engineering structure. Nagendra et al. (2005) developed a response surface model based on the optimal noise, vibration, and harshness (NVH) response sensitivity data for optimizing tailored rolled blank (TRB) components. Machine learning, particularly surrogate models, provides a novel solution for the optimization of rolling linear guide pairs. It directly maps the complex nonlinear relationships between geometric parameters (contact angle and curvature ratio) and mechanical properties (including stiffness and stress). It reduces FEM time from hours to minutes, saving significant iterative computation costs for optimization design.

However, surrogate models typically demand a large number of samples to construct datasets, resulting in substantial computational and workload burdens. To enhance efficiency, it is necessary to reduce sample size requirements without losing the accuracy. Therefore, few-shot learning methods (Huisman et al., 2021) have been developed, possessing strong generalization capabilities, minimal data requirements, and high learning efficiency. Su et al. (2022) developed a novel DRHRML (data reconstruction hierarchical recurrent meta-learning) method for few-shot learning in intelligent bearing fault detection. Liu et al. (2016) proposed a support vector machine (SVM) gait controller trained with limited samples for stable bipedal robotic walking. In the field of few-shot learning, model-agnostic meta-learning (MAML) learns a highly task-sensitive model initialization, enabling rapid adaptation to new tasks with minimal examples (Fallah et al., 2020). To validate the algorithm selection rationale in this study, a comparative analysis was conducted between MAML and other algorithms. Compared to the Reptile algorithm (Nichol and Schulman, 2018), which employs first-order approximations for improved computational efficiency, its predictive accuracy may be inadequate for complex nonlinear regression problems. In contrast, meta-SGD (Li et al., 2017) introduces learnable learning rates, which increases the risk of overfitting when handling extremely small datasets. Transfer learning requires a large and relevant source task, which is often difficult to find in mechanical engineering. Therefore, this study selects MAML for application in the optimization design of rolling linear guide pairs. With only 126 sample sets, it achieves a prediction accuracy with R2 exceeding 0.91.

Rolling linear guide pairs serve as core transmission components in precision mechanical equipment. The design and performance metrics must satisfy the stringent requirements of high-end application scenarios. The details are outlined below.

  1. High positioning accuracy and motion smoothness. High stiffness and low deformation are core requirements for ensuring precision.

  2. High load capacity and long service life. Controlling the maximum contact stress is the key to ensuring service life.

  3. High reliability and stability. The design must ensure consistent and predictable performance under various working conditions, preventing performance degradation caused by improper parameter matching.

This paper proposes a meta-learning-based inverted-bottleneck residual fully connected network model. The surrogate models for guide pair stiffness and maximum contact stress were established, and the SLSQP optimization algorithm was employed to solve structural parameter optimization. In addition, this study develops a parametric design platform for guide pairs, which enhances the efficiency of dataset construction through parametric modelling and automated FEM.

The structure of this paper is organized as follows: Sect. 2 establishes the response surface model for predicting the mechanical performance of guide pairs, detailing the few-shot learning process. Section 3 presents the developed parametric modelling platform and couples it with automated FEM, establishing an integrated design–simulation workflow. In Sect. 4, comprehensive analysis of the results is conducted. Finally, Sect. 5 summarizes the study.

2 Meta-learning-based inverted-bottleneck residual fully connected network model

Few-shot learning is a machine learning paradigm that achieves efficient learning with limited training samples. This section integrates the inverted-bottleneck structure with a fully connected network and employs the model-agnostic meta-learning (MAML) algorithm to train this model, achieving rapid cross-task adaptation capability.

2.1 Enhanced Latin hypercube sampling for input sample set construction

To ensure the coverage of the parameter space, a sampling strategy is introduced in this section. The Latin hypercube sampling (LHS) method is employed to ensure a uniform and representative distribution of input samples within the parameter space. In addition, a Euclidean distance constraint is introduced to prevent samples from being excessively close to each other.

Before conducting LHS, the value ranges of all parameters must be normalized to the interval [0,1]. For a raw parameter xj [aj,bj], the normalized value xj is calculated as follows:

(1) x j = x j - a j b j - a j .

Specifically, the value range of each dimension is uniformly divided into N equal subintervals. For each dimension j, a random permutation πj is generated, where πj(i) denotes the subinterval number assigned to the i-th sample in the j-th dimension. Based on this, the specific value Xij for each sample point can be obtained through the following calculation:

(2) X i j = π j i - u i j N ,

where uij is a random number sampled from the uniform distribution U(0,1). Following LHS sampling, the parameters must undergo de-normalization processing, linearly mapping Xij from [0,1] to the actual parameter range [aj,bj] for each variable:

(3) S i j = a j + b j - a j X i j ,

where Sij denotes the actual parameter values output by the final Latin hypercube sampling. When adding new samples, a new sample point must not lie too close to any existing samples to avoid spatial clustering and data redundancy. Therefore, when adding new samples, a Euclidean distance (Parhizkar, 2015) is introduced to ensure sample dispersion:

(4) D g p , X = min k = 1 , N p - x k 2 > δ g ,

where p denotes the new candidate sample, X represents the set of existing samples, N indicates the sample count, 2 is the Euclidean norm, and δg=0.5 denotes the global distance threshold. The global threshold δg=0.5 was set based on empirical rules commonly used in experimental design. This setting aims to ensure a sufficient minimum Euclidean distance between sample points in the normalized parameter space. It effectively prevents sample clustering while guaranteeing spatial-filling properties. However, it should be noted that, for parameters with smaller value ranges (such as the ball-positioning angle θ and the interference fit δ in Sect. 4.2, which have small value ranges), a threshold of 0.5 is evidently excessive and inappropriate. Thus, these parameters should be separately assigned a threshold of δ=0.05.

2.2 Inverted-bottleneck residual fully connected network model

Deep neural networks (DNNs) are a significant branch of artificial neural networks (Sze et al., 2017). This section utilizes a fully connected deep neural network (FCNN). By integrating an inverted-bottleneck structure and residual connections, a novel deep residual multi-layer perceptron (MLP) architecture is developed. The cascade architecture – comprising an input layer, multiple residual blocks, and an output layer – effectively captures nonlinear mappings between rolling linear guide design parameters and mechanical performance.

The structural design of neural networks is a pivotal factor in determining their performance. In traditional bottleneck architectures (e.g. ResNet's Bottleneck), a “wide–narrow–wide” pattern is typically adopted. High-dimensional inputs are compressed into a low-dimensional space for feature extraction and then are expanded back to the original high-dimensional space. In contrast, inverted-bottleneck structures (e.g. MobileNetV2) employ the opposite “narrow–wide–narrow” paradigm: low-dimensional inputs are expanded into a high-dimensional space for feature extraction and then return to low dimensionality (Sandler et al., 2018).

Figure 1 illustrates the dimensional transformation schematic of the bottleneck structure and the inverted-bottleneck structure. The inverted-bottleneck structure overcomes the limitations of feature extraction in low-dimensional spaces. This enables the model to possess stronger representational capabilities and capture higher-level features more readily.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f01

Figure 1Comparison of dimension transformation with and without inverted-bottleneck structure.

Download

In inverted-bottleneck structures, the ReLU activation function is typically positioned after the dimensionality expansion layer and preceding the compression layer. Its primary role is to introduce nonlinear transformations, enabling the model to learn more complex functions. Without the ReLU function, multiple linear transformations would be equivalent to a single linear transformation, severely limiting the model's representational capacity. Figure 2 depicts the inverted-bottleneck residual fully connected network model (IB-ResFCN) proposed in this study. Table 1 specifies the detailed parameters of the model, which consists of one input layer, six residual blocks, and two regression output layers.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f02

Figure 2Schematic diagram of the inverted-bottleneck residual fully connected network architecture.

Download

Table 1Layer parameters of the fully connected neural network.

Download Print Version | Download XLSX

Residual connections directly link each residual block and effectively prevent gradient vanishing in MLP. Their mathematical expression is as follows:

(5) y = x + i = 1 6 F ( x ) ,

where y denotes the final output, x represents the input, and F(x) signifies the transformation applied to the current input by the network layers.

Residual blocks create a shortcut path, enabling gradients to rapidly back propagate. The first three residual blocks gradually extract higher-dimensional features through progressively increasing dimensions, thereby enhancing the model's expressive capability. The last three residual blocks employ the inverted-bottleneck structure to achieve dimensional expansion, thereby refining the features.

2.3 MAML for supervised learning

Based on the IB-ResFCN model, meta-learning is applied to enhance prediction accuracy. Meta-learning includes three stages: meta-training, meta-validation, and meta-testing. According to the stages, the dataset is divided into the ratio of training set (Dtrain) : validation set (Dval) : test set (Dtest)=6:2:2.

Based on distinct task types, meta-learning can be categorized into supervised meta-learning (SML) and meta-reinforcement learning (Meta-RL). Few-shot learning has been extensively researched in the supervised-task domain. Its objective (Finn et al., 2017) is to learn a new function from a limited number of input–output pairs using prior data from similar tasks through meta-learning. This paper employs the model-agnostic meta-learning (MAML) algorithm applied to SML.

In SML, the training set is divided into T meta-tasks, each including K samples. These samples are divided into support and query sets at a 7:3 ratio. Figure 3 illustrates the classification method for the training set. The support set is used for the inner loop, and the query set is used for the outer loop.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f03

Figure 3Schematic diagram of training-set partitioning.

Download

As shown in Fig. 4, at the beginning of the meta-training, three critical parameters are defined: (1) initial meta-parameters θ0, (2) inner-loop learning rate α and (3) outer-loop learning rate η. Then multiple epochs of meta-training will be conducted. In each epoch, the training set is divided into N meta-tasks: Ti (i=1,2,...,n)∈Dtrain. Each task is assigned a unique meta-learner Mi and its corresponding initial parameters θi. By using the corresponding task Ti, task-specific fast weights θ1 are computed. Subsequently, the gradient and loss 𝓛 for the current round are computed. Then, the meta-learner for the next round Mi+1 and the parameters θi+1 are updated. We repeat the above process until all rounds of meta-training are completed. Finally, the meta-initialization parameters θn are acquired. Table 2 shows the parameter settings for MAML training.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f04

Figure 4Meta-parameter optimization framework for meta-training process.

Download

Table 2MAML parameter settings.

Download Print Version | Download XLSX

Considering only a single gradient update step, this section explains how θi is updated to θi+1 in task Ti+1. Based on θi and the support set data, θi+1 is computed via gradient descent.

(6) θ i = θ i - α θ L S θ i

In the above, S represents the loss function of the support set. The purpose of the inner loop is to rapidly adapt the model parameters of the support set. In the query set, the loss LQ(θi+1) is computed using the fast weights θi+1:

(7) θ L Q θ i + 1 = δ L Q θ i + 1 δ θ i + 1 δ θ i + 1 δ θ i .

Through the chain rule, the relation of θi+1 in θi is incorporated into the gradient computation. Based on this, we update θi to θi+1:

(8) θ i + 1 = θ i - η θ L Q θ i + 1 .

The aim of the outer loop is to optimize θi, ensuring that the generated fast weights θi+1 achieve optimal performance in the query set. After N task iterations, the final meta-initial parameters θn can be calculated.

(9) min θ i = 1 n L Q θ i = i = 1 n L Q θ i - 1 - α θ L S θ i - 1

(10) θ n = θ 0 η i = 1 n θ L Q θ I

SML can address both classification and regression problems. In regression analysis, a loss function (θ) is designed to measure the error between predicted outputs and true values. The (θ) for supervised regression is the mean squared error (MSE), formulated as follows:

(11) L θ = x ( j ) , y ( j ) T i | | f θ x ( i ) y ( j ) | | 2 2 ,

where x(j) and y(j) denote the input value and the true value of a sample within Ti. Algorithm 1 is the MAML algorithm for supervised learning. https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-g01

2.4 Fine-tuning phase

The preceding section explained the algorithm for SML and the process of obtaining θn. However, θn cannot serve as a new starting point for rapid task adaptation of the model. Due to insufficient training samples per meta-task during meta-training, the model fails to adequately learn task-specific patterns. Thus, after meta-training, fine-tuning is performed using the entirety of the training-set samples. This process ensures higher model-learning efficiency. We perform K training iterations on θn to update and obtain θk:

(12) θ k = θ k - 1 α T θ L train θ k - 1 ,

where αT denotes the learning rate during the fine-tuning phase, and train represents the loss function of the training set. The value of αT is set to 1/10 of η since θn is already situated within a well-performing region. A smaller learning rate ensures stable and precise convergence, which prevents missing of the optimal solution or divergence due to excessively large step sizes.

The parameter θk is integrated into the model, and the validation loss is computed using validation set samples and Eq. (11). The training process is configured for 100 rounds, with the θk parameters obtained in each round being evaluated for their loss in the validation set. If the loss decreases, the parameters θk obtained in that training round are saved as θtest. If the loss shows no improvement, the early-stopping mechanism is triggered. When the validation loss ceases to decrease for 10 consecutive training epochs, the training is halted, and the process reverts to the model parameters that achieved the minimum validation loss. The θtest is integrated into the model, serving as a new starting point for rapid task adaptation.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f05

Figure 5Overall workflow of meta-learning.

Download

Figure 5 illustrates the overall workflow of meta-learning, primarily divided into the training phase and the evaluation phase: (1) first, establish the dataset using LHS; (2) separate the dataset based on distinct functional purposes; (3) start training the IB-ResFCN to obtain θtest; and (4) assign the learned θtest to the IB-ResFCN and evaluate it based on the test set.

3 Development of a parametric assisted design platform

Dataset construction requires modelling and finite-element analysis (FEA) for each sample. Sequential manual modelling for different samples entails a significant workload and low efficiency. Currently, parametric modelling is widely adopted, effectively reducing the time required to generate models. However, existing research rarely couples parametric modelling with FEA software for rapid modelling and analysis. This study develops an integrated platform coupling parametric modelling with FEA, significantly enhancing the efficiency of dataset construction.

The platform was developed within the Visual Studio development environment. Through API integration, the platform achieves direct invocation and collaborative control of SolidWorks and Abaqus, demonstrating robust system integration capabilities and cross-platform compatibility. The platform's overall architecture follows a “three-tier coupling, closed-loop-driven” technical approach. It comprises three core functional modules: the interaction layer, the logic layer, and the execution layer. The platform seamlessly integrates user input, data processing, and simulation computation, forming a complete closed-loop design–simulation cycle.

The interaction layer serves as the human–machine interface of the platform, implementing a graphical interface based on C# and WinForm technology. Users can intuitively perform parameter configuration and process control. As illustrated in Fig. 6, the overall interface consists of a parameter input zone, a structure preview zone, and a data management zone. The parameter input zone includes 25 critical fundamental parameters such as ball radius, guide rail dimensions, and mesh size. The structural schematic is embedded centrally, displaying key geometric features and dimension annotations. The data management zone enables custom file path configuration, automated saving strategies, and systematic organization of simulation results.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f06

Figure 6Schematic diagram of the parametric-platform interface. Dimensions in mm, loads in MPa, and mesh units in mm2. The structural schematic references Structural Diagram 2 as an example.

Download

Building upon parameter acquisition in the interaction layer, the logic layer acts as an intermediary bridge. It maps fundamental user inputs (e.g. ball diameter d, ball-positioning angle θ, initial contact angle β) into structural feature quantities required for modelling (guide rail dimensions, ball positions, assembly relationships, etc.). This mapping relationship can be mathematically formulated as follows:

(13) S = f ( P ) ,

where P= (d, θ, β,...) denotes the input parameter set; S represents the structural feature set required for modelling; and the mapping function f () is implemented through function fitting, lookup tables, or polynomial interpolation. To more clearly demonstrate the mapping relationship between S and P, the linear guide shown in Fig. 7 is taken as an example, with Table 3 providing specific mappings of key parameters. Through the mapping mechanism in the logic layer, automated modelling of structural-logic construction algorithms driven by dimensional chains is achieved.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f07

Figure 7Parameters for parametric modelling of the linear guide.

Download

Table 3Mapping relationship between S and P for the linear guide.

Download Print Version | Download XLSX

The execution layer covers two core processes: 3D modelling and FEA. Upon model completion, we select the FEM option in Fig. 6 to initiate FEA. As shown in Fig. 8, the STEP file exported from SolidWorks retains complete assembly information of the guide rail mechanism. Upon reading the file, the platform automatically imports the model into Abaqus. Subsequently, the pre-configured Python script file (py.) is automatically retrieved. This file contains command streams for preprocessing tasks and job submission modules. The platform automatically controls Abaqus to execute the complete workflow, including defining properties, meshing, setting boundary conditions, and so on.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f08

Figure 8Automated FEA workflow diagram.

Download

Upon completion of the solving process, the platform automatically archives both CAE simulation files and ODB result files. This integrated system achieves rapid modelling and automated FEA, substantially enhancing the efficiency of dataset construction. As shown in Table 4, an experiment was conducted to evaluate the efficiency and reliability of the parametric platform. This study compared the traditional design method with the parametric-platform design method.

Table 4Performance comparison between traditional design and parametric platform.

Download Print Version | Download XLSX

Initially, 100 sets of guide samples were randomly generated for modelling and simulation on the platform. Subsequently, 10 sets of models were randomly selected from the 100 sets of 3D models for manual modelling. The results indicate that, although a very small number of models failed due to parameter interference, the modelling success rate reached 98 %. With high reliability, the platform boosts single-iteration efficiency dozens of times over traditional methods.

4 Result analysis

4.1 Validation of FEA results vs. experimental results

In FEA, many components of the guide rail are non-load-bearing; it is thus necessary to simplify the overall model. The rolling linear guide pair is divided into several independent structural units, with each representing a repetitive structure of the guide assembly. Each unit includes three parts: the guide rail block, the slider block, and the upper- and lower-ball sets.

In the preprocessing module, the elastic modulus and Poisson's ratio for the ball–groove contact are set to E=2.06×105 MPa ν=0.3. For mesh generation, the multi-zone partitioning method is employed to obtain hexahedral meshes, with mesh refinement being applied to the contact regions. Surface-to-surface contact is adopted to ensure proper interaction at the ball–groove interface. A load is applied to the top of the slider, while full constraints are imposed on the bottom of the guide rail.

Vertical stiffness is a critical metric for assessing guide rail pair performance. It quantifies vertical deformation under vertical loads, defined by the following:

(14) K v = P Δ h v ,

where P denotes the external load, Kv represents the approximate vertical stiffness, and Δhv indicates the deformation magnitude in the vertical direction. For the FEM of an independent structural unit, the schematic diagram of its simulation model is shown in Fig. 9, with the load distribution given by P0=P/n. β denotes the contact angle, defined by the oblique line connecting the curvature centres of the guide groove, the ball, and the slider. Since the groove radii of the guide rail and slider are identical, their vertical deformations are approximately equal. The total vertical deformation is given by Δhv=Δh1+Δh2.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f09

Figure 9Schematic diagram of the simulation model.

Download

Through this method, the vertical deformation of the guide rail pair can be obtained. Substituting this value into Eq. (14) yields the vertical stiffness. However, inherent errors in FEA preclude achieving exact vertical-stiffness values, yielding only approximate results. To verify the accuracy of FEA results. This study takes the D45 guide pair as the subject, conducting static experiments to compare the Δhv from FEA.

The full-parameter test bench for rolling linear guide pairs is illustrated in Fig. 10, primarily composed of a measurement system and a control system, a hydraulic system, and a mechanical system. Figure 10b displays the mechanical system comprising both loading and measurement components. The measurement components primarily include a contact displacement sensor (linearity error ≤0.5 % F.S), an S-shaped tension–compression load cell (linearity error ≤0.5 % F.S), signal amplifiers, and communication modules. The contact displacement sensor converts internal movement caused by operational accuracy variations into electrical signals. These signals are then transmitted to the control system via an amplifier, a communication module, and RS232 serial communication. The S-shaped tension–compression load cell generates resistance signals in response to external force variations, which are converted into digital communication signals by a signal amplifier and then are transmitted to the control system via RS232 serial communication. Once the signals enter the control system, the acquisition program collects, displays, and saves them.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f10

Figure 10Full-parameter test bench for rolling linear guide pairs. Panel (a) shows the test bench, and panel (b) shows the mechanical system.

Download

The testing procedure is as follows: install the rolling linear guide on the test bench. Apply load through the hydraulic system, increasing stepwise from 0 to 20 kN in 5 kN increments. First, obtain the force and displacement vectors at each node under the initial load condition. Next, calculate the variation in force and displacement at each measurement point. This variation is relative to the initial load condition during each force-holding period. After averaging the data, the resulting values represent the force and displacement for that time period. Finally, apply the least squares method to fit the force–displacement curves across all time periods, thereby deriving the vertical displacement.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f11

Figure 11Comparative results of experimental data and FEM.

Download

The obtained experimental results are compared with FEM, with the comparative outcomes being illustrated in Fig. 11. To quantitatively evaluate the prediction accuracy of the finite-element model, multiple statistical error metrics were calculated based on the aforementioned data, as shown in Table 5.

Table 5Statistical errors of finite-element model predictions.

Download Print Version | Download XLSX

Results demonstrate a high degree of consistency between FEA and experimental outcomes, confirming the accuracy and validity of the finite-element model. This validated model can be reliably utilized for subsequent research tasks.

4.2 Influence of structural parameters on mechanical performance

Using the finite-element model in Fig. 8, we analyse the influence of different parameters on the mechanical performance of the guide rail pair. The parameters and their value ranges are as follows:

  1. initial contact angle β (30–60°) – complementary angle of the load direction relative to the raceway-transmitted resultant force on rolling elements;

  2. curvature ratio f (0.5–0.6) – the ratio of the groove radius of the guide pair to the ball diameter;

  3. ball-positioning angle θ (27–37°) – angle between the upper-left/lower-right raceway centre line and the lower-left/upper-right raceway centre line;

  4. interference fit δ (5–50 µm) – the value of interference between the balls and the grooves of the guide rail and slider.

Taking vertical deformation as the research object, FEA is conducted under loads of 5–20 kN, with β, f, θ, and δ as single variables. The influence of contact parameters on the vertical deformation of guide rail pairs is demonstrated, as detailed in Fig. 12.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f12

Figure 12Influence patterns of contact parameters on guide rail pairs.

Download

Analysis reveals that variations in contact parameters profoundly alter the mechanical performance of rolling linear guide pairs. Furthermore, this study systematically quantified the degree of influence of various parameters on the Δhv of the guide pair. As shown in Fig. 13, the samples were collected based on LHS. The multi-factor global sensitivity analysis method (Lamboni and Kucherenko, 2021) was employed, systematically analysing the main effects and total effects of each parameter.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f13

Figure 13Multi-factor global sensitivity analysis.

Download

The results indicate that the interference fit is the dominant parameter affecting vertical displacement (0.839). The ball-positioning angle and curvature ratio also exert moderate influences, while the contact angle demonstrates a minimal effect (<0.1). Furthermore, the total effects of all four parameters consistently exceed their respective main effects. This finding reveals significant interaction effects among the parameters, where their synergistic interplay collectively governs the vertical displacement. Therefore, the geometric parameters of the contact zone can serve as inputs, with vertical deformation and maximum contact stress as outputs. A neural network with four inputs and one output is constructed as shown in Fig. 14. This network is trained to predict mechanical performance.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f14

Figure 14Neural network inputs and outputs for parameter optimization.

Download

The prediction models for vertical deformation and contact stress share identical architectures, differing only in terms of parameters such as sample sets, data partitioning, and meta-learning rates.

4.3 Comparative analysis of network models

This section validates the structural rationality of the IB-ResFCN model and its prediction accuracy. Comparative validation experiments involving multiple models were conducted.

Taking the vertical deformation prediction model as an example, comparative experiments were designed for three key aspects: models with and without meta-learning, with and without inverted-bottleneck structures, and with and without residual connections. Experiments compared the proposed model against a conventional FCNN. Mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R2) serve as evaluation metrics.

Figure 15 presents the results of comparative experiments. To ensure statistical reliability, this study adopts the 5-fold cross-validation method. The complete dataset was randomly partitioned into five similarly sized subsets, with each subset sequentially serving as the test set for performance evaluation to avoid chance results arising from specific data partitioning. Table 6 presents the statistical results of the 5-fold cross-validation, including the mean values of three metrics for each model. Furthermore, based on the 5-fold cross-validation results, the 95 % confidence intervals for each metric were calculated using the t distribution (with t=4).

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f15

Figure 15Comparative experiments. The black lines in both subfigures indicate median values, while the dotted lines represent mean values. Subfigure (a) displays the RMSE and MAE values for each model, and subfigure (b) shows their corresponding R2 values.

Download

Table 6The 5-fold cross-validation performance metric statistics.

Download Print Version | Download XLSX

Overall, the improved models show significant advantages over traditional FCNN in terms of both accuracy and stability. In terms of prediction accuracy, the FCNN + IB model achieved the highest coefficient of R2 among all compared models. Regarding stability, the FCNN + MAML model demonstrates optimal performance in error control, achieving the lowest values in terms of RMSE. Statistically, all improved models show superior performance compared to the baseline model, with their 95 % CIs showing minimal overlap or complete separation. This demonstrates the effectiveness of the model improvements, their statistical significance, and the strong robustness across different data partitions.

Furthermore, this study conducted ablation experiments to investigate the impact of dimensional design in the IB-ResFCN. Based on the network framework structure described in Sect. 2.2, a systematic comparison was conducted of the model's predictive performance across different dimensions.

As shown in Table 7, dimensional variations directly impact the model's final prediction accuracy. Based on the comparison of three accuracy metrics, the dimensionality (64–32/48/64–64) was determined to be the optimal configuration. This outcome is the prediction result for the vertical deformation model. Figure 16 presents the prediction results of both the vertical deformation and contact stress models.

Table 7Comparison of ablation experiment data.

Download Print Version | Download XLSX

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f16

Figure 16Prediction results. The stress prediction model accuracy metrics are as follows: MAE = 83.42, RMSE = 99.23, and R2= 0.924.

Download

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f17

Figure 17Loss curve during meta-training.

Download

Furthermore, Fig. 17 illustrates the loss evolution curve during the meta-training phase. After approximately 30 training cycles, the meta-training loss reached a plateau. This phenomenon is related to the designed distribution of meta-tasks: during each meta-iteration, the model processes a batch containing N=20 distinct tasks, where each task is constructed by randomly sampling K=15 data points from the training set. These are divided into support sets (10 samples) and query sets (5 samples) for meta-gradient computation. This random sampling strategy ensures the diversity and representativeness of training tasks. The initial rapid decline in loss stems from the model learning a robust set of initialization parameters. The observed plateau suggests successful learning of the underlying task distribution patterns. Additional meta-training iterations provide negligible improvements. With the loss value stabilizing, the model exhibits rapid adaptation ability for tasks from comparable distributions.

4.4 Optimization solving results

The sensitivity analysis in Sect. 4.2 demonstrates how structural parameters affect vertical stiffness. These parameters interact in complex ways and not through simple linear addition. To address this challenge, the SLSQP algorithm from SciPy's optimization library was implemented. This method (Gill et al., 2005) proves to be highly effective for constrained optimization problems characterized by smooth nonlinear functions in both objective and constraint formulations.

4.4.1 Constraint conditions and convergence criteria

The SLSQP algorithm transforms nonlinear constrained problems into sequential quadratic programming subproblems. This iterative approach efficiently converges to optimal solutions. This approach integrates two classical optimization algorithms: sequential quadratic programming (SQP) and least squares (LS). This algorithm proves to be effective for solving the constrained optimization problem in this study:

(15) minimize x R 4 Δ h v ( x ) subject to g ( x ) [ σ ] - σ max ( x ) 0 x l x x u , x l = π 6 , 5 , 1 2 27 π 180 , x u = π 3 , 50 , 3 5 37 π 180 ,

where the optimization variable xR4 represents the structural parameters from Sect. 4.2; Δhv(x) is the minimization objective function; g(x) denotes the constraint function; and “l” and “u” respectively define the lower and upper bounds of the optimization variables. Within the constraint conditions, the maximum allowable stress is [σ]=800 MPa, and σmax(x) represents the predicted maximum stress value.

The SLSQP algorithm handles these constraints through an internal Lagrangian formulation. This formulation consistently maintains solution feasibility throughout the optimization process rather than employing an external penalty method. The convergence criteria are specified as follows:

  1. function value tolerance – ftol =1×10-6 (relative change in objective function);

  2. parameter step tolerance – xtol =1×10-6 (maximum change in optimization variables);

  3. maximum iterations – 300 (computational budget limit).

These complementary tolerances ensure convergence stability. The optimization process terminates when any condition is met. This approach ensures both computational efficiency and high-quality solutions. Figure 18 illustrates the dual-network framework implementing this algorithm.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f18

Figure 18Schematic of dual-network joint architecture.

Download

4.4.2 Multi-start optimization results and sensitivity analysis

To ensure optimization reliability and to mitigate local optima risks, this study implements a comprehensive multi-start strategy. Starting from 100 randomly generated initial points within the feasible region, the SLSQP algorithm was executed independently for each starting configuration. Figure 19 displays the distribution histogram of vertical deformation values obtained from 100 optimization runs.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f19

Figure 19Distribution histogram of vertical deformation values.

Download

Results indicate an optimization interval of [10.54, 14.50], and the success rate is 94 %. Over 70 % of results exceed the mean value, clustering tightly near the optimum. The optimal solution is reachable from diverse initial conditions. These results further confirm the stability of the optimization algorithm. Additionally, this study conducted a sensitivity analysis of the initial conditions. As shown in Table 8, we calculate the coefficient of variation (CV) for all optimal parameters. The Δhv, β, f, and θ all exhibit CV values below 10 %. These are classified as low-sensitivity parameters. The δ demonstrates a CV value between 10 %–20 %, classifying it as moderately sensitive. This necessitates strict tolerance control during actual manufacturing.

Table 8Sensitivity analysis of optimized parameters.

Download Print Version | Download XLSX

For clearer demonstration of performance metric trade-offs across the design space, Fig. 20 shows the distribution of all feasible solutions within the design space. In the figure, the red line indicates the stress constraint, while the magenta line represents the overall variation trend. The results show a positive correlation between vertical deformation and maximum contact stress. This trend indicates that, under the stress constraint condition, pursuing high stiffness typically requires reducing contact stress. Therefore, optimizing structural parameters improves these two critical performance metrics concurrently.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f20

Figure 20Distribution of feasible solutions in the performance space.

Download

Table 9 demonstrates that the optimized guide rail pair achieves a 57 % reduction in vertical deformation, a significant enhancement in vertical stiffness, and a declining trend in maximum stress values.

Table 9Initial parameters vs. optimized parameters.

Download Print Version | Download XLSX

To ensure the accuracy of optimization results, the optimized parameters are input into a parametric platform for modelling and simulation validation, with the outcomes illustrated in Fig. 21. Validation shows a mere 2.9 % deviation in maximum stress values and only 9.7 % variation in extracted vertical deformation. Consequently, the optimization results demonstrate high accuracy.

https://ms.copernicus.org/articles/17/141/2026/ms-17-141-2026-f21

Figure 21Finite-element validation diagram.

Download

Compared with traditional FEA optimization methods, this study develops a workflow encompassing neural network modelling, dual-network collaborative frameworks, and optimization solving to derive an optimal set of structural parameters.

5 Conclusions

This study addresses the multi-objective optimization design problem for rolling linear guide rail pairs. An intelligent optimization framework based on a meta-learning surrogate model is proposed in this study. This method significantly relieves the over-reliance on costly FEM in traditional design processes and effectively addresses the challenge of constructing high-precision models with limited sample data. This study conducts the following research:

  1. A platform integrating parametric modelling and automated FEA was developed. This approach dramatically enhances the efficiency of sample data generation, providing a highly reliable dataset for surrogate models.

  2. An IB-ResFCN model was constructed by integrating the inverted-bottleneck residual structure with FCNN. Through a MAML training strategy, the model achieves over 90 % prediction accuracy for mechanical performance under few-shot conditions.

  3. Based on the constructed surrogate model, the SLSQP algorithm is employed for multi-objective optimization of key structural parameters. Results demonstrate that the optimized structure achieves significantly enhanced mechanical performance, fully validating the effectiveness and engineering value of the proposed optimization method.

Code availability

The code developed for this study is not publicly available due to the confidential nature of the research project.

Data availability

The data supporting this study are not publicly available due to the confidential nature of the research project.

Author contributions

CS developed the parametric design platform, performed the finite-element analysis and experimental validation, and contributed to the model development and optimization algorithms. WD participated in the platform development, provided methodological guidance, and supervised the project. SL and JH supported funding acquisition and result validation. All of the authors reviewed and approved the final paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We are grateful for the financial support from the Zhejiang Jianbing & Lingyan Key Technologies R & D Programme (grant no. 2024C01245(SD2)). In addition, this study was also sponsored by the Ningbo Key Research and Development Program (grant no. 2023T013), the 2024 Cixi Key Research and Development Program (grant no. CZ2025012), and the Ningbo Key Technologies R & D Programme (grant no. 2025Z006).

Financial support

This research has been supported by the Zhejiang Jianbing & Lingyan Key Technologies R & D Programme (grant no. 2024C01245(SD2)), the Ningbo Key Research and Development Program (grant no. 2023T013), the 2024 Cixi Key Research and Development Program (grant no. CZ2025012) and Ningbo Key Technologies R & D Programme (grant no. 2025Z006).

Review statement

This paper was edited by Zhiwei Zhu and reviewed by two anonymous referees.

References

Cheng, S. W.: Analysis of non-uniform load distribution and stiffness for a preloaded roller linear motion guide, Mech. Mach. Theory, 164, 104407, https://doi.org/10.1016/j.mechmachtheory.2021.104407, 2021. 

Fallah, A., Mokhtari, A., and Ozdaglar, A.: On the convergence theory of gradient-based model-agnostic meta-learning algorithms, arXiv [preprint], https://doi.org/10.48550/arXiv.1908.10400, 2020. 

Finn, C., Abbeel, P., and Levine, S.: Model-agnostic meta-learning for fast adaptation of deep networks, in: Proceedings of the 34th International Conference on Machine Learning, edited by: Precup, D. and Teh, Y. W., PMLR, 70, 1126–1135, https://doi.org/10.48550/arXiv.1703.03400, 2017. 

Fishwick, P. A.: Neural network models in simulation: a comparison with traditional modeling approaches, in: Proceedings of the 21st conference on Winter simulation, ACM, New York, NY, USA, 702–709, https://doi.org/10.1145/76738.76828, 1989. 

Gill, P. E., Murray, W., and Saunders, M. A.: SNOPT: An SQP algorithm for large-scale constrained optimization, SIAM Rev., 47, 99–131, https://doi.org/10.1137/S0036144504446096, 2005. 

Huisman, M., van Rijn, J. N., and Plaat, A.: A survey of deep meta-learning, Artif. Intell. Rev., 54, 4483–4541, https://doi.org/10.1007/s10462-021-10004-4, 2021. 

Jiang, Y., Cao, Q., Liu, R., Wu, W. T., He, Y., and Yan, H.: Rapid optimization of rocket projectile tailfins based on multi-fidelity neural networks with transfer learning, Comput. Struct., 315, 107801, https://doi.org/10.1016/j.compstruc.2025.107801, 2025. 

Kazemi, F., Çiftçioğlu, A. Ö., Shafighfard, T., Asgarkhani, N., and Jankowski, R.: RAGN-R: a multi-subject ensemble machine-learning method for estimating mechanical properties of advanced structural materials, Comput. Struct., 308, 107657, https://doi.org/10.1016/j.compstruc.2024.107657, 2025. 

Lamboni, M. and Kucherenko, S.: Multivariate sensitivity analysis and derivative-based global sensitivity measures with dependent variables, Reliab. Eng. Syst. Safe., 212, 107519, https://doi.org/10.1016/j.ress.2021.107519, 2021. 

Li, Z., Zhou, F., Chen, F., and Li, H.: Meta-SGD: learning to learn quickly for few-shot learning, arXiv [preprint], arXiv:1707.09835, https://doi.org/10.48550/arXiv.1707.09835, 2017. 

Liu, W. J., Zhang, S., Lin, J., Jiang, S. N., Chen, Z., Zhu, H., and Chen, G.: Investigation of static characteristics for linear rolling guide with considering geometric errors, Tribol. Int., 187, 108698, https://doi.org/10.1016/j.triboint.2023.108698, 2023. 

Liu, Z., Wang, L., Zhang, Y., and Chen, C. L. P.: A SVM controller for the stable walking of biped robots based on small sample sizes, Appl. Soft Comput., 38, 738–753, https://doi.org/10.1016/j.asoc.2015.10.029, 2016. 

Nagendra, S., Staubach, J. B., Suydam, A. J., Ghunakikar, S. J., and Akula, V. R. K.: Optimal rapid multidisciplinary response networks: RAPIDDISK, Struct. Multidiscip. O., 29, 213–231, https://doi.org/10.1007/s00158-004-0472-2, 2005. 

Nichol, A. and Schulman, J.: Reptile: a scalable metalearning algorithm, arXiv [preprint], arXiv:1803.02999, https://doi.org/10.48550/arXiv.1803.02999, 2018.  

Ohta, H. and Hayashi, E.: Sound of linear guideway type recirculating linear ball bearings, J. Sound Vib., 235, 847–861, https://doi.org/10.1006/jsvi.2000.2950, 2000. 

Parhizkar, R.: Euclidean distance matrices: properties, algorithms and applications, IEEE Signal Proc. Mag., 32, 12–30, https://doi.org/10.1109/MSP.2015.2398954, 2015. 

Phuyal, S., Bista, D., and Bista, R.: Challenges, opportunities and future directions of smart manufacturing: a state of art review, Sustainable Futures, 2, 100023, https://doi.org/10.1016/j.sftr.2020.100023, 2020. 

Quan, L. and Zhao, W.: A review on positioning uncertainty in motion control for machine tool feed drives, Precis. Eng., https://doi.org/10.1016/j.precisioneng.2024.03.003, 2024. 

Ramu, P., Thananjayan, P., Acar, E., Bayrak, G., Park, J. W., and Lee, I.: A survey of machine learning techniques in structural and multidisciplinary optimization, Struct. Multidiscip. O., 65, 1–31, https://doi.org/10.1007/s00158-022-03369-9, 2022. 

Roy, A., Manna, R., and Chakraborty, S.: Support vector regression based metamodeling for structural reliability analysis, Probabilist. Eng. Mech., 55, 78–89, https://doi.org/10.1016/j.probengmech.2018.11.001, 2019. 

Sahoo, S. and Lo, C. Y.: Smart manufacturing powered by recent technological advancements: a review, J. Manuf. Syst., 64, 236–250, https://doi.org/10.1016/j.jmsy.2022.06.008, 2022. 

Shimizu, S.: Stiffness analysis of linear motion rolling guide, J. Jpn. Soc. Precis. Eng., 64, 1573–1576, 1998. 

Staroszyk, D., Müller, J., and Ihlenfeldt, S.: Increasing the service life prediction accuracy of linear guideways considering real operating conditions, Mech. Mach. Theory, 201, 105734, https://doi.org/10.2139/ssrn.4806567, 2024. 

Su, H., Xiang, L., Hu, A., Xu, Y., and Yang, X.: A novel method based on meta-learning for bearing fault diagnosis with small sample learning under different working conditions, Mech. Syst. Signal Pr., 169, 108765, https://doi.org/10.1016/j.ymssp.2021.108765, 2022. 

Sun, W., Kong, X., Wang, B., and Li, X.: Statics modeling and analysis of linear rolling guideway considering rolling balls contact, P. I. Mech. Eng. C-J. Mech., 229, 168–179, https://doi.org/10.1177/0954406214531943, 2015. 

Sze, V., Chen, Y. H., Yang, T. J., and Emer, J. S.: Efficient processing of deep neural networks: a tutorial and survey, P. IEEE, 105, 2295–2329, https://doi.org/10.1109/JPROC.2017.2761740, 2017. 

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L. C.: Mobilenetv2: inverted residuals and linear bottlenecks, in: Proceedings of the IEEE Conference on CVPR, 4510–4520, https://doi.org/10.1109/CVPR.2018.00474, 2018. 

Tong, V. C., Khim, G., Hong, S. W., and Park, C. H.: Construction and validation of a theoretical model of the stiffness matrix of a linear ball guide with consideration of carriage flexibility, Mech. Mach. Theory, 140, 123–143, https://doi.org/10.1016/j.mechmachtheory.2019.05.021, 2019. 

Tong, V. C., Khim, G., Park, C. H., and Hong, S. W.: Linear ball guide design optimization considering stiffness, friction force, and basic dynamic load rating using particle swarm optimization, J. Mech. Sci. Technol., 34, 1313–1323, 2020. 

Wang, L., Kolios, A., Nishino, T., Delafin, P.-L., and Bird, T.: Structural optimisation of vertical-axis wind turbine composite blades based on finite element analysis and genetic algorithm, Compos. Struct., 153, 123–138, https://doi.org/10.1016/j.compstruct.2016.06.003, 2016. 

Wang, W., Li, C., Zhou, Y., Wang, H., and Zhang, Y.: Nonlinear dynamic analysis for machine tool table system mounted on linear guides, Nonlinear Dynam., 94, 2033–2045, https://doi.org/10.1007/s11071-018-4473-x, 2018. 

Wei, J., Li, C., and Ma, Y.: Finite element model for static characteristic analysis of rolling linear guide, P. I. Mech. Eng. C-J. Mech., 236, 1721–1732, https://doi.org/10.1177/09544062211021443, 2022. 

Xu, M., Zhang, H., Li, C., Yao, G., Zhang, Y., and Gao, Y.: An analytical nonlinear dynamic model for linear guide with carriage flexibility, Int. J. Nonlin. Mech., 148, 104251, https://doi.org/10.1016/j.ijnonlinmec.2022.104251, 2023.  

Yang, H., Wang, Z., Zhang, T., and Du, F.: A review on vibration analysis and control of machine tool feed drive systems, Int. J. Adv. Manuf. Tech., 107, 503–525, https://doi.org/10.1007/s00170-020-05041-2, 2020. 

Zha, J., Cheng, K., Xue, F., and Lu, Y.: Hydrostatic guideways for precision machines: the state-of-the-art and future perspectives, Tribol. Int., 110060, https://doi.org/10.1016/j.triboint.2024.110060, 2024. 

Zou, H. T. and Wang, B. L.: Investigation of the contact stiffness variation of linear rolling guides due to the effects of friction and wear during operation, Tribol. Int., 92, 472–484, https://doi.org/10.1016/j.triboint.2015.07.005, 2015. 

Download
Short summary
Rolling guides are essential parts of precision machines, yet their design often depends on slow and costly simulations. This study presents a computer model that learns from a small number of examples to predict and improve guide performance. Combined with an automated design platform, it enables faster, more accurate, and efficient optimisation of mechanical components.
Share