1. Introduction
The fiber optic gyroscope, based on the Sagnac effect, represents a new generation of high-performance inertial sensors. It has gained prominence as the preferred inertial instrument for medium- and high-precision inertial navigation systems due to its outstanding advantages, such as high accuracy, solid-state design, and high reliability. It finds extensive applications in various domains, including marine, terrestrial, aerial, and space sectors [
1,
2]. Currently, there are two primary technical approaches for fiber optic gyroscopes: the traditional fiber optic gyroscope with discrete packaging of core optical components and the integrated optical fiber gyroscope. With the rapid development of optoelectronics, microelectronics, and micro/nano-processing technologies, integrated optical chips that combine various functions such as emission, coupling, modulation, and detection have made significant breakthroughs. These integrated optical chips are widely used in the field of optical communications, especially silicon-based photonic devices that are compatible with traditional CMOS (Complementary Metal-Oxide-Semiconductor) processes. They offer advantages like compact size, high performance, and cost-effectiveness, making them a key technological solution for future photonic integration [
3,
4]. Whether it is the traditional discrete fiber optic gyroscope or the integrated optical fiber gyroscope, the optical fiber coil serves as the sensitive core component, and its performance directly impacts the engineering precision of fiber optic gyroscopes [
5].
The temperature performance of the optical fiber coil has the most significant impact on the zero-bias error of the fiber optic gyroscope [
6]. When the fiber optic gyroscope is used in high-temperature environments, temperature effects can cause geometric deformation of the optical fiber coil, deviating from its original output characteristics, resulting in thermal errors. For sensing devices whose working principles depend on geometric characteristics, thermal errors are an unavoidable source of error [
7,
8]. Currently, thermal error compensation is an effective means to reduce thermal deformation errors. It calculates the thermal error magnitude at various temperatures through algorithms and provides feedback compensation. Its advantage lies in not requiring hardware modifications; it can reduce thermal errors through software alone [
9]. The key to thermal error compensation is the ability to obtain accurate thermal error values. However, for fiber optic gyroscopes, thermal errors are difficult to directly separate from the measured values. Therefore, it is feasible to indirectly predict thermal errors based on temperature. However, this approach requires the establishment of a mathematical model between thermal errors and temperature for the fiber optic gyroscope. In theory, if the thermal expansion coefficient of the optical fiber coil is known, it is possible to derive the additional phase error caused by temperature based on its working principle. However, the material composition of fiber optic gyroscopes is complex, including not only the core but also cladding, stress zones, inner coatings, and outer layers [
10]. This complexity makes it difficult to determine the comprehensive thermal expansion coefficient when various materials are combined. Additionally, due to differences in manufacturing processes for different fiber optic gyroscopes and variations between production batches, it is challenging to establish a quantitative thermal error model for fiber optic gyroscopes purely through theoretical derivation.
In addition to pure theoretical derivation, data-driven methods are also an effective approach for modeling complex objects. For thermal deformation errors in fiber optic gyroscopes, it is possible to directly measure phase errors at different temperatures and model them based on data. This method has been widely applied in the study of thermal errors in numerical control machine tools [
11], with the common practice being the establishment of a model between key points’ temperatures on the machine tool and thermal errors. Multiple regression benefits from its simplicity and fast computation speed, making it the most commonly used method. Mayr [
12] and Zimmermann [
13,
14] utilized multiple regression algorithms to propose a thermal error compensation mechanism that updates the model online. It assesses the model’s effectiveness through periodic accuracy checks, supplements new measurement data if the model becomes ineffective, and rebuilds the model. Wei [
15], using the Gaussian process regression algorithm, established an adaptive thermal error model that adjusts according to environmental changes. Machine tool structures vary widely, and some exhibit complex thermal deformation error patterns. Therefore, neural networks have also found extensive application. Li [
16] combined stochastic theory, genetic algorithms, and the radial basis function neural network (RBFNN) algorithm to create a comprehensive model that considers thermal errors, temperature, and random disturbance factors, thereby improving the robustness of thermal error prediction. Liu [
17,
18] leveraged the long-term memory properties of LSTM neural networks to establish thermal error models, allowing multiple time data points to contribute to thermal error prediction and enhancing prediction accuracy. Additionally, some researchers [
19] have based their work on convolutional neural networks, using deep learning to directly establish mathematical models between the overall machine tool temperature field and thermal errors.
In the aforementioned research on modeling thermal errors in numerical control machine tools, despite using different algorithms, they all employed the same fundamental strategy: increasing the number of modeling samples to enhance the model’s generalization properties and reduce overfitting. Overfitting [
20] can lead the model to fit irrelevant information from the training data, such as errors, leading to a significant deterioration in the model’s predictive performance on data not included in the modeling process. The scarcity of modeling data samples is one of the reasons for the overfitting problem, and it is also a central challenge in modeling thermal deformation errors in fiber optic gyroscopes. Modeling thermal deformation errors in fiber optic gyroscopes is challenging because the measurement of these errors is difficult. Each temperature point measurement requires a relatively long wait time to allow the temperature field to reach equilibrium. For mass-produced products, it is challenging to perform intensive temperature interval sampling. Additionally, the performance of sensors can be highly unstable under frequent temperature fluctuations, necessitating frequent compensation and correction of the sensor’s own thermal errors. All these factors lead to data-driven modeling of thermal deformation errors in fiber optic gyroscopes, and it is necessary to overcome the overfitting problem caused by a small sample size.
In response to these challenges, this paper introduces a modeling algorithm that combines theoretical models with modeling data. Taking into account the stability of the fiber optic gyroscope’s material, it is possible to deduce the comprehensive thermal expansion coefficient of the optical fiber material and establish a theoretical model for thermal deformation errors in the fiber optic gyroscope through a single intensive temperature experiment. Subsequently, for different batches of fiber optic gyroscopes, while there may be process variations, the resulting characteristic changes can be distributed around the theoretical model following a certain probability density function. If a confidence probability is set, and it is believed that the occurrence of characteristics outside the confidence probability is a rare event, then the distribution of characteristic variations remains within a limited range around the theoretical model. This indicates that the theoretical model still contains valuable information. Consequently, when modeling, the theoretical model can be used as a regularization term, combined with small-sample data to address overfitting issues. Based on this, the algorithm proposed in this paper is referred to as the TD-model modeling algorithm, and its effectiveness is validated through experiments. Based on this approach, the algorithm proposed in this paper is referred to as the TD-model modeling algorithm, and its effectiveness has been experimentally validated.
2. The Principle of a Fiber Optic Gyroscope
Figure 1 shows two beams of light, A and B, propagating along a fiber optic ring in different directions. In this setup, fiber optic A travels through the fiber optic ring in a clockwise direction, while fiber optic B travels in a counterclockwise direction. Both fiber optic beams follow a path with a radius of R within the fiber optic ring.
The geometric distance traveled by two optical fibers within a stationary optical fiber ring is equal, as follows.
Now, the optical fiber ring rotates at a certain angular velocity Ω, such as clockwise, as shown in
Figure 2.
According to the Sagnac effect, the geometric distance traveled by light beams A and B around the optical fiber ring is no longer equal, as follows.
In which, and represent the geometric distances traveled by light beams A and B around the optical fiber ring for one complete rotation, and and denote the time spent.
Therefore, when the optical fiber ring starts to rotate, there is a certain optical path difference between the two light beams at the exit position, resulting in a phase difference . By measuring the phase difference between the two light beams, the rotational angular velocity Ω of the optical fiber ring can be calculated.
According to the Lorentz–Einstein velocity transformation formula, the propagation velocities of light beams A and B in the rotating optical fiber ring no longer have a simple proportionality relationship with the refractive index. Specifically, it is as follows.
where
and
are the propagation velocities of light beams A and B within the optical fiber ring,
is the speed of light in a vacuum, and
n is the refractive index of the optical fiber ring.
Combining Equations (2) and (3), it can be seen that
Because the frequency of light remains constant while propagating through a medium, therefore
where
,
, and
represent the wavelengths of light beams A, B, and in vacuum, respectively.
Furthermore, it is possible to calculate the number of wavelengths experienced by light beams A and B after one rotation around the optical fiber ring, denoted as
and
, respectively. As follows:
Subsequently, when light beams A and B converge, the difference in the number of wavelengths between them is
Equation (8) can be simplified to
Therefore, the phase difference is
For an
N-turn optical fiber, every turn generates a phase difference ∆Φ, and consequently, the total phase difference for an
N-turn optical fiber ring is
4. Verification Experiment
To validate the effectiveness of the fiber optic gyroscope’s thermal error model, temperature variation experiments were conducted on the fiber optic gyroscope for detection.
4.1. Experimental Plan
Place the fiber optic gyroscope with dimensions of 50 × 50 in a temperature chamber, and the optical fiber phase variation was measured within the temperature range of −18 to 65 °C and the angular velocity range of −860 to 860°/h. The experimental setup, as shown in
Figure 3 and
Table 1, involved mounting the gyroscope under test inside a temperature-controlled chamber with a rotating platform. The temperature was gradually reduced to −18 °C at a rate of 2 °C/min, followed by a 2 h insulation period. The rotating platform was then tested at different speeds: ±860, ±500, ±360, ±180, ±90, ±30, ±10, ±1, ±0.5, ±0.3, and ±0.1°/h, and so on, at 0 °C, 25 °C, 40 °C, and 65 °C, replicating the same tests at each temperature point.
4.2. Experimental Data
As shown in
Figure 4, these are the phase errors of the fiber optic gyroscope at each temperature test point.
For better visualization, the raw data in this study have been processed.
Figure 4 represents the difference between the phase measurement values at each test point and the theoretical values under ideal conditions at 20 °C. These differences are referred to as phase errors. The theoretical values are calculated by substituting the angular velocity values into Equation (12).
It can be observed that as the temperature gradually increases, the phase error exhibits clear non-linearity but maintains a monotonous changing trend. The slope of the phase error with respect to angular velocity changes from negative values to positive values gradually, and with the rise in temperature, the rate of increase also becomes greater.
4.3. Experimental Results
To validate the effectiveness of the thermal error modeling, this study chose to build the model using data from three temperature test points: 0 °C, 25 °C, and 40 °C. Subsequently, the data from two temperature test points, −18 °C and 65 °C, were used for prediction, serving as a test of the model’s predictive accuracy for data not included in the modeling process.
First, based on Equation (25), the comprehensive coefficient of thermal expansion for the optical fiber ring was estimated to be . This value was then used in Equation (16) to calculate the model’s theoretical values. Subsequently, the theoretical values and measurement data were jointly used in Equation (28) to establish the model, referred to as the Theoretical-Data-Driven model or TD-model. For the purpose of comparison, this study directly performed regression on the measurement data to establish a control model known as the Pure Data model or OD-model.
Subsequently, based on the original measurement data, the input values required for the model predictions, namely,
,
, and Ω, as in Equation (16), were computed. These values were separately used in the TD-model and OD-model to obtain model predictions, which were then compared with the actual measurement values, as shown in
Figure 5.
It can be observed that the TD-model significantly outperforms the OD-model, especially in the −18 °C environment where the OD-model exhibits predictive results that are completely opposite to the measurement values.
To quantitatively assess the predictive performance of the two modeling approaches, the relative root mean square error (
for each model’s predictions of the measurement data was calculated. The
calculation method is shown in Equation (29).
where
represents the model’s prediction, and
represents the measured result. According to Equation (11),
RMSE reflects the sum of squares of differences between the model’s predicted results and the actual measurement results. The larger the differences, the larger the
RMSE. Using the range of measured values as the range, the relative
RMSE as a percentage of the full-scale range was calculated, as shown in
Table 2.
The data indicate that the predictive accuracy of the TD-model is improved by 58% compared to the OD-model, demonstrating the effectiveness of the proposed algorithm.
5. Discussion
Discussing and analyzing the experimental results.
One of the most significant controversies encountered during the research process is why the predictive results of the OD-model at −18 °C are opposite to the actual measurement values, while the TD-model provides accurate results.
The modeling data used for this study come from measurements taken at temperatures of 0 °C, 25 °C, and 40 °C. As observed in
Figure 4, examining the variation in phase errors at these three temperature points reveals that if we fix the gyroscope at a certain rotational speed, the phase error decreases non-linearly and monotonically with respect to temperature. For example, at a rotational speed of −860°/h, plotting the variation curve of phase error with temperature and fitting it with the quadratic model results in the pattern shown in
Figure 6.
It can be observed that although the measurement data exhibit a monotonically decreasing trend, the fitting results suggest that the most appropriate quadratic curve assumes that the phase error is monotonically increasing with temperature near 0 °C, thereby leading to an overfitting issue. For other angular velocities, the same conclusion can be drawn, and thus, further elaboration is not necessary.
This issue is one of the reasons for introducing the TD-model. Fiber optic gyroscope testing experiments are relatively complex, making it difficult to conduct a large number of experiments, resulting in limited data. These limited data can easily lead to overfitting. The TD-model introduces a theoretical model to correct the distortion caused by overfitting. By observing Equation (28), it can be seen that the TD-model’s calculation formula is actually a variant of ridge regression [
21]. If
is set to 0, it becomes ridge regression. According to the properties of ridge regression [
22], a slight increase in the value of η, although causing a slight deviation of the model’s expectation from the true value, leading from an unbiased regression to a biased regression, can effectively suppress overfitting. This significantly increases the probability of the model falling close to the true value. One difference between the TD-model and ridge regression is that if ridge regression increases the value of η, the model coefficients tend to approach zero, whereas the TD-model makes the model coefficients tend to be closer to the theoretical values of the true values. Thus, while suppressing overfitting, the TD-model significantly reduces the degree to which the model’s expectation deviates from the true value, allowing for the establishment of a more effective model.
Furthermore, this paper analyzes the stability of sensitivity and zero drift of the fiber optic gyroscope after compensation. Sensitivity and zero drift are calculated using the phase signals before and after compensation, respectively. The signal before compensation is the original phase measurement value, and the signal after compensation is the original phase measurement value minus the phase error predicted by the TD-model.
For the fiber optic gyroscope, sensitivity is defined in this paper as the phase change caused by unit rotational speed variation, as shown in Equation (30), where the output is the phase signal of the fiber optic gyroscope, and the input is the rotational speed. The specific calculation method involves the least squares fitting coefficients between the output and input at each temperature point.
Zero drift refers to the phase error when the rotational speed is 0°/h, and in this paper, 0.1°/h phase error is taken as the zero drift.
As shown in
Figure 7, the data illustrate the variations in sensitivity and zero drift with temperature before and after compensation.
As shown in
Figure 7, after compensation, the temperature stability of sensitivity has significantly improved, but the improvement in zero drift is not apparent. According to Equations (12) and (30), it can be determined that the theoretical value of sensitivity is
Furthermore, theoretically, sensitivity will be influenced by the deformation of the fiber structure. However, the theoretical value of zero drift, obtained by setting Ω = 0 and substituting into Equation (12), is a constant 0, indicating that it theoretically will not be affected by structural changes. Therefore, compared to zero drift, sensitivity is more susceptible to the influence of temperature, and after compensation, stability can also be improved.
6. Conclusions
This paper has conducted modeling and analysis of the thermal deformation error in fiber optic gyroscopes. By combining theoretical derivation with experimental data, a new thermal error modeling algorithm has been proposed, significantly improving the modeling accuracy of thermal deformation error in fiber optic gyroscopes. The specifics are as follows.
Based on the working principles of fiber optic gyroscopes, a theoretical model was derived for the phase error induced by thermal deformation in the fiber optic ring and its relationship with temperature and angular velocity. The model reveals that the phase error in fiber optic gyroscopes is quadratically related to temperature. However, obtaining a quantitative theoretical model relies on accurate thermal expansion coefficients of the materials used in the fiber optic ring. Since the fiber optic ring is composed of multiple materials in combination, determining the thermal expansion coefficients theoretically becomes challenging.
Thermal deformation error measurement experiments were conducted on a specific model of a fiber optic ring. Phase errors relative to the ideal environment at 20 °C were measured at different angular velocities and temperatures. The experimental results revealed a nonlinear correlation between phase error and temperature, aligning with theoretical expectations. However, due to the complexity of the experiments and the difficulty in conducting them on a large scale, direct modeling using experimental data resulted in overfitting issues and poor predictive accuracy of the model. This paper introduces a novel modeling algorithm that combines the theoretically derived model with experimental data, effectively mitigating the overfitting problem. The model established is referred to as the Theoretical-Data-Driven model, abbreviated as TD-model. It has been validated that the TD-model significantly enhances the predictive accuracy of phase errors induced by thermal deformation in fiber optic gyroscopes. Compared to a model based solely on experimental data, the TD-model demonstrates a 58% improvement in predictive accuracy, thus confirming the model’s effectiveness.