| Version: | 2018.5-1 |
| Date: | 2018-05-12 |
| Title: | Manage Tick-by-Tick Transaction Data |
| Author: | Francesco Calvori, Fabrizio Cipollini, Giampiero M. Gallo and 'gzstream' authors. |
| Maintainer: | Fabrizio Cipollini <cipollini.fabrizio@gmail.com> |
| Depends: | R (≥ 3.0-2) |
| LinkingTo: | Rcpp (≥ 0.11.0) |
| SystemRequirements: | zlib headers and library |
| Description: | Manager of tick-by-tick transaction data that performs 'cleaning', 'aggregation' and 'import' in an efficient and fast way. The package engine, written in C++, exploits the 'zlib' and 'gzstream' libraries to handle gzipped data without need to uncompress them. 'Cleaning' and 'aggregation' are performed according to Brownlees and Gallo (2006) <doi:10.1016/j.csda.2006.09.030>. Currently, TAQMNGR processes raw data from WRDS (Wharton Research Data Service, https://wrds-web.wharton.upenn.edu/wrds/). |
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
| URL: | https://cran.r-project.org/package=TAQMNGR |
| NeedsCompilation: | yes |
| Packaged: | 2018-05-20 05:32:34 UTC; cipollini |
| Repository: | CRAN |
| Date/Publication: | 2018-05-20 06:37:40 UTC |
TAQ Manager
Description
Manage tick-by-tick transaction data
Usage
TAQ.CleanTickByTick(dirInput, dirOutput,
window = 80, deltaTrimmed = 0.10, granularity = 0.04, useCleaned = TRUE)
TAQ.Aggregate(dirInput, symbol, bin, useAggregated = TRUE)
TAQ.Report(dirInput, symbol)
TAQ.Read(dirInput, symbol, import = NULL, startDate, endDate, bin)
Arguments
dirInput |
A character scalar: the input directory. |
dirOutput |
A character scalar: the output directory.
It must be different from |
window |
A numeric integer scalar: the window size for the trimming procedure of data clean (see Details). |
deltaTrimmed |
A numeric scalar into $(0,1)$: the trimming proportion (see Details). |
granularity |
A numeric positive scalar: the granularity parameter (see Details). |
useCleaned |
A logical scalar: if |
useAggregated |
A logical scalar: if |
symbol |
A character (vector in |
startDate |
A numeric integer scalar: the start date in the yyyymmdd format. |
endDate |
A numeric integer scalar: the end date in the yyyymmdd format. |
bin |
A numeric integer scalar: the bin size (in seconds) for aggregating data. |
import |
A character: the list of fields to be imported. One or more among: |
Details
The meaning of the arguments window, deltaTrimmed, and granularity is detailed in the reference below.
References
Brownlees, C. T., and Gallo, G. M. (2006). Financial Econometric Analysis at Ultra–High Frequency: Data Handling Concerns, Computational Statistics and Data Analysis 51, 2232–2245.
Examples
#### A fake dataset for running the example can be downloaded at
#### 'http://local.disia.unifi.it/cipollini/webpage-new/data/data_sample.txt.gz'
## Input
# dirInput <- "path of the input folder"
# dirOutput <- "path of the output folder" ## Must be different from 'dirInput'
## Clean
# TAQ.CleanTickByTick(dirInput = dirInput, dirOutput = dirInput)
## Make the report (1 at a time)
# TAQ.Report(dirInput = dirOutput, symbol = c("DOG")) ## A scalar symbol
# TAQ.Report(dirInput = dirOutput, symbol = c("GNU")) ## A scalar symbol
## Aggregate
# TAQ.Aggregate(dirInput = dirOutput, symbol = c("DOG", "GNU"), bin = 300,
# useAggregated = TRUE)
## Import data
# dog <- TAQ.Read(dirInput = dirOutput, symbol = "DOG",
# startDate = 00010101, endDate = 20141231, bin = 300)
TAQ Manager
Description
The package manages tick-by-tick transaction data, performing cleaning, aggregation and import.
Details
The package manages tick-by-tick transaction data, performing cleaning, aggregation and import in an efficient and fast way (the package engine is developed in C++).
Cleaning and Aggregation are performed according to Brownlees and Gallo (2006).
| Package: | TAQMNGR |
| Type: | Package |
| Version: | 2015.2-1 |
| Date: | 2015-02-21 |
| License: | GPL (>= 2) |
Note
Currently, the package processes raw data from WRDS (Wharton Research Data Service).
They have to satisfy the following requirements:
all fields have to be included (select the '
Check All' button at the WRDS downloading page);
select the fixed-width text and '
G zip' as output format and compression type, respectively, at the WRDS downloading page.
An example with fake raw data can be downloaded at http://local.disia.unifi.it/cipollini/webpage-new/data/data_sample.txt.gz.
The package uses the following libraries: 'Gzstream' (available at 'http://www.cs.unc.edu/Research/compgeom/gzstream/' under LGPL license), and 'zlib' (freely available at 'http://www.zlib.net/').
Author(s)
Francesco Calvori <francesco.calvori@gmail.com>,
Fabrizio Cipollini <cipollini.fabrizio@gmail.com>,
Giampiero M. Gallo <giampiero.gallo@gmail.com>.
Maintainer: <fabrizio.cipollini@gmail.com>
References
Brownlees, C. T., and Gallo, G. M. (2006). Financial Econometric Analysis at Ultra–High Frequency: Data Handling Concerns, Computational Statistics and Data Analysis 51, 2232–2245.