Results 1 to 7 of 7
  1. #1
    MeiHua's Avatar
    MeiHua is online now FX-Men Honorary Member
    Join Date
    Jul 2011
    Location
    In the Market
    Posts
    1,239

    Default Synthetic Data, never run out of OOS data again.

    Here is another simple exercise anyone can use. its great because it generates unlimited amounts of data for you to back test over, practice on, evaluate a system, or just look at. The reason why this method which is a spin off of what Tuschar Chande does is because it uses the entire range of data you set instead of random sampling. With random sampling you could resample moves, here each move is only used once. Also this does a lot of the same work as my data mining system. It maintains the same distribution of movement as the market actually had over that period, even though it is using a randomize that is normal in distribution. So if you ever wanted more practice on a single pair but have used all the historical data your provider gave you, you can generate more that will maintain that specifics markets personality.

    Here is a picture of some real vs synthetic data I generated its the 1st 100 bars of a 15 minute chart.


    You can see that in the synthetic price patter you still have, dojis, engulfing bars, quiet times as well as large range bars. This is a true test because if your system can still come out positive in this environment it is able to pick out something that is NON-RANDOM and there is no chance for you to curve fit, as you can recycle this data a million times and arrive at a million different price structures.

    here a pic of the spreadsheet you can make it yourself its easy enough.




    Instructions:
    1) export and cut and paste the time and price data
    2) calculate the differences between the OHLC of the bar to the C of the previous bar
    3) copy values and add rand() function
    4) sort values by rand() column
    5) use the close of the 1st candle of real data as reference, calculate new bars based on the scrambled movements
    6) save and format to be imported in to the charting package.
    7) enjoy

    ps: I got the spreadsheet layout from a different forum. I just thought I would share it, but i cant link it.
    Last edited by MeiHua; 08-18-2012 at 06:40 PM.


  2. #2
    ClarkFX's Avatar
    ClarkFX is online now FX-Men Honorary Member
    Join Date
    Sep 2010
    Location
    Calgary, Alberta
    Posts
    1,296
    You're going to have a hard time getting accurate testing results from randomly generated data because it does not completely reflect true market behaviours. Rather than thinking the market is completely random, I'd like to think the market is pseudo-random.

  3. #3
    MeiHua's Avatar
    MeiHua is online now FX-Men Honorary Member
    Join Date
    Jul 2011
    Location
    In the Market
    Posts
    1,239
    Quote Originally Posted by ClarkFX View Post
    You're going to have a hard time getting accurate testing results from randomly generated data because it does not completely reflect true market behaviours. Rather than thinking the market is completely random, I'd like to think the market is pseudo-random.
    I completely agree with you. Because i isolated the volatility, this is not random, it remains as fat tailed as the original distribution. The only difference is the order in which these events occur, so your edge should still remain assuming that your exploiting some thing that exists that is non random, that occurs in the market no matter what order the candles are given.

  4. #4
    Jezzode's Avatar
    Jezzode is offline FX-Men Honorary Member
    Join Date
    Sep 2010
    Location
    London Tower Bridge, UK
    Posts
    1,137
    Excuse me for being old fashioned, but it's far easier to use real historical data. Nothing better than the real thing! :P
    30 pairs, 10 years on each...who would need any more data!
    You learn more looking for the answer to a question than you do being told the answer straight away.

  5. #5
    MeiHua's Avatar
    MeiHua is online now FX-Men Honorary Member
    Join Date
    Jul 2011
    Location
    In the Market
    Posts
    1,239
    That point is taken. 7,5 years In sample, 2.5 out of sample is usually my standard deal. Using uncorrelated or correlated pairs is also useful. But I could always use more with the same dynamics as the market I am trying to trade it on. In general probably no ones going to use this. But when i make something i need 30+ System years of solid data before i make a decision.

  6. #6
    ClarkFX's Avatar
    ClarkFX is online now FX-Men Honorary Member
    Join Date
    Sep 2010
    Location
    Calgary, Alberta
    Posts
    1,296
    Quote Originally Posted by MeiHua View Post
    That point is taken. 7,5 years In sample, 2.5 out of sample is usually my standard deal. Using uncorrelated or correlated pairs is also useful. But I could always use more with the same dynamics as the market I am trying to trade it on. In general probably no ones going to use this. But when i make something i need 30+ System years of solid data before i make a decision.
    There's a few recent articles stating that the average life of a positive return algorithm is around 3 months. Trying to create a robust system across multiple markets and such large samples is tough. Also, with regime shifts I find it redundant to test on such large samples.

  7. #7
    MeiHua's Avatar
    MeiHua is online now FX-Men Honorary Member
    Join Date
    Jul 2011
    Location
    In the Market
    Posts
    1,239
    Quote Originally Posted by ClarkFX View Post
    There's a few recent articles stating that the average life of a positive return algorithm is around 3 months. Trying to create a robust system across multiple markets and such large samples is tough. Also, with regime shifts I find it redundant to test on such large samples.
    Do your systems not incorporate a regime shift mechanism? Or do you just turn some on and some off? Assuming those regimes return in some point in the futures, as there is some cyclical/seasonal effect? I would love to check those articles if you could link em. For me its about robustness and portfolio diversification. To be honest I am pretty green behind the ears here since i have never traded live with an algo. So any advice would be good. Just taking what i see out there, and what makes sense.

Similar Threads

  1. Replies: 0
    Last Post: 05-28-2012, 06:23 PM
  2. Dollar Rallies on Strong Data, but Watch Out for a Big Week
    By DailyFx in forum The Analyst Arena
    Replies: 0
    Last Post: 07-25-2008, 04:40 PM
  3. Replies: 0
    Last Post: 12-10-2007, 07:10 PM
  4. Replies: 0
    Last Post: 08-08-2007, 05:40 PM
  5. Replies: 0
    Last Post: 05-10-2007, 05:30 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
"The secret to creativity is knowing how to hide your sources."
Albert Einstein