hi
just read the last couple of post on your journal, about backtesting on data...
... I must confess that i don't know how your backtesting is done, but, if you can import data from excel to backtest on in any way, some thing you might like to try is called "bootstrapping" (slightly different from the bootstrapping that goes on in FI pricing);
Say you have a series of 100 pieces of data (returns), and you are looking to backtest for a period of 20 days. A couple of ways to go about generating a data series is by:
* picking 20 of the 100 data points at random (i.e. each piece of data has probability 1/100 of being picked), string them together, and backtest your idea on this series; do that over and over until the cows come home, then average the results (bootstrapping is a little like a MonteCarlo simulation, except that is uses actual data, including the "outlying" days that MC's can miss
** Another way of doing a similar thing is to pick a starting data point and set this as "0". Then, draw a normal distrubition around "0" (i.e. with 0 as the mean) and pick your next data point, at random, from the distribution...
... I am not explaining it very well. Imagine your 100 pieces of data lined up vertically; set the middle point as "0" - and label prices up and down from 0 - 50 and 0 - -50 respectively. Then start at 0, draw up a normal distribution around it, and pick "1" - i.e. you try and "cluster" a series of 20 data points that were relatively close together in time - i mean, with the first method, you might get a series that starts with 19 oct 87 - big moves - followed by the last day of trading before christmas, with bugger all happening; it's not representative of realistic market conditions. Instead, you pick 19/10/87 as "0", then pick - at random - the next date's returns, which between 1 S.D. might be from 10th - 28th october; again, do this until the cows come home, and look for extremes / average results.
All of this you can do in excel, then (with any luck) run your backtesting software on the data - hope it's of some use to you.
You mention that you are using per tick data for your algorithm; can I ask why you elected for tick data instead of, say, 15 min intervals? The reason I ask is, at a tick by tick level, i assume the big boys will wipe the floor with you? I dont mean offence, but at that kind of granularity, massive processing speed, bare minimum of latency and so on are surely necessary to survive?
just read the last couple of post on your journal, about backtesting on data...
... I must confess that i don't know how your backtesting is done, but, if you can import data from excel to backtest on in any way, some thing you might like to try is called "bootstrapping" (slightly different from the bootstrapping that goes on in FI pricing);
Say you have a series of 100 pieces of data (returns), and you are looking to backtest for a period of 20 days. A couple of ways to go about generating a data series is by:
* picking 20 of the 100 data points at random (i.e. each piece of data has probability 1/100 of being picked), string them together, and backtest your idea on this series; do that over and over until the cows come home, then average the results (bootstrapping is a little like a MonteCarlo simulation, except that is uses actual data, including the "outlying" days that MC's can miss
** Another way of doing a similar thing is to pick a starting data point and set this as "0". Then, draw a normal distrubition around "0" (i.e. with 0 as the mean) and pick your next data point, at random, from the distribution...
... I am not explaining it very well. Imagine your 100 pieces of data lined up vertically; set the middle point as "0" - and label prices up and down from 0 - 50 and 0 - -50 respectively. Then start at 0, draw up a normal distribution around it, and pick "1" - i.e. you try and "cluster" a series of 20 data points that were relatively close together in time - i mean, with the first method, you might get a series that starts with 19 oct 87 - big moves - followed by the last day of trading before christmas, with bugger all happening; it's not representative of realistic market conditions. Instead, you pick 19/10/87 as "0", then pick - at random - the next date's returns, which between 1 S.D. might be from 10th - 28th october; again, do this until the cows come home, and look for extremes / average results.
All of this you can do in excel, then (with any luck) run your backtesting software on the data - hope it's of some use to you.
You mention that you are using per tick data for your algorithm; can I ask why you elected for tick data instead of, say, 15 min intervals? The reason I ask is, at a tick by tick level, i assume the big boys will wipe the floor with you? I dont mean offence, but at that kind of granularity, massive processing speed, bare minimum of latency and so on are surely necessary to survive?