Synthetic Data: Would you use it to develop trading strategies?

I’ve been exploring the use of synthetic data as input data for the development of systematic trading strategies. Some of the benefits that made me interested in this are:

  • Mitigating Overfitting: Synthetic data helps in creating more robust trading strategies by mitigating the issue of overfitting that often arises when using limited historical data. This allows strategies to be more adaptable and effective in various market conditions​​.
  • Simulating Market Conditions: Synthetic data includes a range of market scenarios, including rare events and crises that may not be well-represented in historical data. This exposure helps in developing trading strategies that are robust to different market conditions and periods of uncertainty​​.
  • Software Testing: Synthetic financial data is used to test and validate trading algorithms and software systems without using real data. This limits the problems that arise when using historical data mulitple times.

In theory this would mean that synthetic data helps creating more robust trading strategies, and allows for a whole new area of innovation in terms of what kind of strategies you can create.

Does anyone have any practical experience with synthetic data?

Hi there, I don’t know what will be your methodology to generate data.

I share a bit what I found using AI to generate trading strategy.

What I see, trading system using statistical model tend to be more profit. It can survive more to market dynamic.
I’ve tried neural network, the result is not good after a long while. You need to train the model again. Sometime it get over fit the system becomes dull. If you are using in your real account, it wont be good thing.

I haven’t found good methodology to predict / forecast market. As long as I know all trading system that claim using AI are statistical based system.

I’ve been using GANs to generate data, and they are able to recreate characterstics as thicker tails and volatility clustering, and are statistically similar to real data. The idea is to generate an X amount of new data streams and use those as input data. So the trading strategies trained on synthetic data have seen X times more data than trading strategies trained on real data, including different market dynamics.

I have doubt GAN can generate realistic data. You may look at USDCHF 2014, GBPUSD 2022.

In the end all depend on your AI model. For me, I had tried NN model 5 layer depth. What I found deeper the model, worse the result.

Not in the field of Forex trading, but in other fields, yes. Here is what my AI friend says:

You

what is synthetic data and how can it be used in forex EA testing?

Copilot

Certainly! Let’s explore synthetic data and its application in Forex Expert Advisor (EA) testing:

  1. What Is Synthetic Data?
  1. Using Synthetic Data in Forex EA Testing:
  • Backtesting: Backtesting involves applying trading strategies to historical data. Synthetic data can be used to create additional scenarios beyond the available historical data.
  • Scenario Testing: Synthetic data allows testing under various market conditions, including extreme scenarios that may not occur frequently in real data.
  • Stress Testing: Simulating extreme market events (e.g., flash crashes, sudden volatility spikes) using synthetic data helps assess EA performance.
  • Model Validation: Synthetic data helps validate EAs against known market patterns and behaviors.
  • Robustness Testing: EAs can be tested against synthetic data with different noise levels, gaps, or missing data.
  • Parameter Optimization: Synthetic data aids in fine-tuning EA parameters for optimal performance.
  1. Generating Synthetic Data for EA Testing:

Remember that while synthetic data provides flexibility, it should be used judiciously. Combining synthetic and real data ensures comprehensive EA testing and robustness assessment324.

Learn more

1multiple.dev2quant.stackexchange.com3eatradingacademy.com4admiralmarkets.com5elitecurrensea.com+3 more

1 Like

Thank you for that! How was your experience with synthetic data? Did it improve your business operations?

Hi, my experience was a good outcome. It’s application was in sizing oil reservoirs for recoverable oil using pulse and interference testing.