D. Abadi, R. Agrawal, A. Ailamaki, M. Balazinska, P. A. Bernstein et al., The beckman report on database research, Communications of the ACM, vol.59, issue.2, pp.92-9910, 1145.
DOI : 10.1145/2845915

S. Abiteboul, B. André, and D. Kaplan, Managing your digital life, Communications of the ACM, vol.58, issue.5, pp.32-35, 2015.
DOI : 10.1145/2670528

URL : https://hal.archives-ouvertes.fr/hal-01068006

S. Abiteboul, P. Bourhis, and V. Vianu, Comparing workflow specification languages: A matter of views, ACM Trans. Database Syst, vol.37, issue.2, p.10, 2012.
URL : https://hal.archives-ouvertes.fr/inria-00539579

S. Abiteboul, G. Miklau, J. Stoyanovich, and G. Weikum, Data, responsibly (dagstuhl seminar 16291) Dagstuhl Reports, pp.42-71, 2016.

N. Foto, J. D. Afrati, and . Ullman, Optimizing multiway joins in a map-reduce environment, IEEE Trans. Knowl. Data Eng, vol.23, issue.9, pp.1282-1298, 2011.

O. Agarwal, M. Chapelle, J. Dudik, and . Langford, A reliable effective terascale linear learning system, Journal of Machine Learning Research, vol.15, pp.1111-1133, 2014.

U. Akdere, M. Cetintemel, E. Riondato, S. B. Upfal, and . Zdonik, The case for predictive database systems: Opportunities and challenges, Conference on Innovative Data Systems Research (CIDR), pp.167-174, 2011.

A. Amarilli, P. Bourhis, and P. Senellart, Provenance Circuits for Trees and Treelike Instances, International Colloquium on Automata, Languages, and Programming (ICALP), pp.56-68, 2015.
DOI : 10.1007/978-3-662-47666-6_5

URL : https://hal.archives-ouvertes.fr/hal-01178399

J. Ameloot, G. Geck, B. Ketsman, F. Neven, and T. Schwentick, Parallel-correctness and transferability for conjunctive queries, Proceedings of the 34th ACM Symposium on Principles of Database Systems, PODS 2015, pp.47-58, 2015.
DOI : 10.1145/2745754.2745759

URL : http://arxiv.org/pdf/1412.4030

J. Angwin, J. Larson, S. Mattu, and L. Kirchner, URL: https://www.propublica.org/article/ machine-bias-risk-assessments-in-criminal-sentencing, Machine bias. ProPublica, 2016.

M. Aref, T. J. Balder-ten-cate, B. Green, D. Kimelfeld, E. Olteanu et al., Design and Implementation of the LogicBlox System, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pp.1371-1382, 2015.
DOI : 10.1007/BF01940876

M. Arenas, P. Barceló, L. Libkin, and F. Murlak, Foundations of Data Exchange, 2014.
DOI : 10.1017/CBO9781139060158

M. Arenas, G. Gottlob, and A. Pieris, Expressive languages for querying the semantic web, Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, PODS '14, pp.14-26, 2014.
DOI : 10.1145/2594538.2594555

M. Arenas, B. C. Grau, and E. Kharlamov, Sarunas Marciuska, and Dmitriy Zheleznyakov. Faceted search over RDF-based knowledge graphs, J. Web Sem, vol.37, pp.55-74, 2016.

M. Arenas and F. Maturana, Cristian Riveros, and Domagoj Vrgoc. A framework for annotating CSV-like data, Proceedings of the VLDB Endowment, p.2016

A. Artale, R. Kontchakov, V. Ryzhikov, and M. Zakharyaschev, A Cookbook for Temporal Conceptual Data Modelling with Description Logics, ACM Transactions on Computational Logic, vol.15, issue.3, pp.1-25
DOI : 10.1016/j.ipl.2009.06.005

A. Atserias, M. Grohe, and D. Marx, Size Bounds and Query Plans for Relational Joins, SIAM Journal on Computing, vol.42, issue.4, pp.1737-1767, 2013.
DOI : 10.1137/110859440

URL : http://arxiv.org/pdf/1711.03860

J. Baget, M. Leclère, M. Mugnier, and E. Salvat, On rules with existential variables: Walking the decidability line, Artificial Intelligence, vol.175, issue.9-10, pp.9-101620, 2011.
DOI : 10.1016/j.artint.2011.03.002

URL : https://hal.archives-ouvertes.fr/lirmm-00587012

E. Balan, T. Milo, and T. Sterenzy, BP-Ex, Proceedings of the 13th International Conference on Extending Database Technology, EDBT '10, pp.713-716, 2010.
DOI : 10.1145/1739041.1739134

V. Bárány, B. Balder-ten-cate, D. Kimelfeld, Z. Olteanu, and . Vagena, Declarative probabilistic programming with datalog, International Conference on Database Theory (ICDT), volume 48 of LIPIcs, pp.1-7, 2016.

S. Barocas and A. D. Selbst, Big Data's Disparate Impact, SSRN Electronic Journal, vol.104, 2016.
DOI : 10.2139/ssrn.2477899

P. Beame, D. Koutris, and . Suciu, Communication steps for parallel query processing, Symposium on Principles of Database Systems (PODS), pp.273-284, 2013.
DOI : 10.1145/3125644

URL : http://dl.acm.org/ft_gateway.cfm?id=3125644&type=pdf

M. Benedikt, W. Fan, and F. Geerts, XPath satisfiability in the presence of DTDs, J. ACM, vol.55, issue.2, 2008.

Y. Bengio, A. C. Courville, and P. Vincent, Representation Learning: A Review and New Perspectives, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.35, issue.8, pp.1798-1828, 2013.
DOI : 10.1109/TPAMI.2013.50

URL : http://www.cs.princeton.edu/courses/archive/spring13/cos598C/Representation Learning - A Review and New Perspectives.pdf

L. Bertossi, Database Repairing and Consistent Query Answering, Synthesis Lectures on Data Management, vol.3, issue.5, 2011.
DOI : 10.1016/j.ipl.2010.07.021

J. Bex, F. Neven, T. Schwentick, and S. Vansummeren, Inference of concise regular expressions and DTDs, ACM Transactions on Database Systems, vol.35, issue.2, p.2010
DOI : 10.1145/1735886.1735890

C. E. Bhattacharya, R. Gerede, R. Hull, J. Liu, and . Su, Towards Formal Analysis of Artifact-Centric Business Process Models, International Conference on Business Process Management (BPM), pp.288-304, 2007.
DOI : 10.1007/978-3-540-75183-0_21

M. Bienvenu, C. Balder-ten-cate, F. Lutz, and . Wolter, Ontology-based data access: A study through Disjunctive Datalog, CSP, and MMSNP, ACM Trans. Database Syst, vol.3933, issue.4, pp.1-33
URL : https://hal.archives-ouvertes.fr/hal-01117583

S. Borgwardt, F. Distel, and R. Peñaloza, The limits of decidability in fuzzy description logics with general concept inclusions, Artificial Intelligence, vol.218, pp.23-55
DOI : 10.1016/j.artint.2014.09.001

M. J. Cafarella, D. Suciu, and O. Etzioni, Navigating extracted data with schema discovery, International Workshop on the Web and Databases (WebDB), 2007.

Z. Cai, L. L. Vagena, S. Perez, P. J. Arumugam, C. M. Haas et al., Simulation of database-valued markov chains using SimSQL, Proceedings of the 2013 international conference on Management of data, SIGMOD '13, pp.637-648, 2013.
DOI : 10.1145/2463676.2465283

D. Calvanese, G. D. Giacomo, D. Lembo, M. Lenzerini, and R. Rosati, Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family, Journal of Automated Reasoning, vol.104, issue.1,2, pp.385-429, 2007.
DOI : 10.1007/s10817-007-9078-x

D. Calvanese, G. D. Giacomo, and M. Lenzerini, Conjunctive query containment and answering under description logic constraints, ACM Transactions on Computational Logic, vol.9, issue.3, pp.22-23, 2008.
DOI : 10.1145/1352582.1352590

URL : http://www.dis.uniroma1.it/~degiacom/papers/2008/calv-degi-lenz-TOCL-2008.pdf

D. Calvanese, G. D. Giacomo, and M. Montali, Foundations of data-aware process analysis, Proceedings of the 32nd symposium on Principles of database systems, PODS '13, pp.1-12, 2013.
DOI : 10.1145/2463664.2467796

S. Cebiric, F. Goasdoué, and I. Manolescu, Query-oriented summarization of RDF graphs, Proceedings of the VLDB Endowment, pp.2012-2015, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01178140

S. Chu, M. Balazinska, and D. Suciu, From Theory to Practice, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pp.63-78, 2015.
DOI : 10.1145/2501928.2501929

N. 38-moustapha-cissé, T. Usunier, P. Artieres, and . Gallinari, Robust Bloom filters for large multilabel classification tasks, Advances in Neural Information Processing Systems (NIPS), 2013.

F. Codd, Understanding relations (installment #7), FDT -Bulletin of ACM SIGMOD, vol.7, issue.3, pp.23-28, 1975.

W. Czerwinski, W. Martens, P. Parys, and M. Przybylko, The (Almost) Complete Guide to Tree Pattern Containment, Proceedings of the 34th ACM Symposium on Principles of Database Systems, PODS '15, pp.117-130, 2015.
DOI : 10.14778/2212351.2212355

C. J. Date, Database in Depth ? Relational Theory for Practitioners Research directions for Principles of Data Management 42 Amit Datta, Michael Carl Tschantz, and Anupam Datta Automated experiments on ad privacy settings, PoPETs, vol.2015, issue.1, pp.92-112, 2005.

B. Susan, J. Davidson, and . Freire, Provenance and scientific workflows: Challenges and opportunities, International Conference on Management of Data (SIGMOD), pp.1345-1350, 2008.

U. Dayal, M. Castellanos, A. Simitsis, and K. Wilkinson, Data integration flows for business intelligence, Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology, EDBT '09, pp.1-11, 2009.
DOI : 10.1145/1516360.1516362

URL : http://www.edbt.org/Proceedings/2009-StPetersburg/edbt/papers/p0001-Dayal.pdf

W. Dembczynski, E. Cheng, and . Hüllermeier, Bayes optimal multilabel classification via probabilistic classifier chains, International Conference on Machine Learning (ICML), pp.279-286, 2010.

D. Deutch and T. Milo, A quest for beauty and wealth (or, business processes for database researchers), Proceedings of the 30th symposium on Principles of database systems of data, PODS '11, pp.1-12, 2011.
DOI : 10.1145/1989284.1989286

A. Deutsch, R. Hull, and V. Vianu, Automatic Verification of Database-Centric Systems, ACM SIGMOD Record, vol.43, issue.3, pp.5-17, 2014.
DOI : 10.1007/978-3-642-19345-3

P. 48-zlatan-dragisic, E. Lambrix, and . Blomqvist, Integrating ontology debugging and matching into the eXtreme design methodology, Workshop on Ontology and Semantic Web Patterns (WOP), volume 1461 of CEUR Workshop Proceedings, 2015. URL: http: //ceur-ws.org

M. Drosou and E. Pitoura, DisC diversity, Proceedings of the VLDB Endowment, pp.13-24, 2012.
DOI : 10.14778/2428536.2428538

C. Dwork, M. Hardt, T. Pitassi, O. Reingold, and R. S. Zemel, Fairness through awareness, Proceedings of the 3rd Innovations in Theoretical Computer Science Conference on, ITCS '12, pp.214-226, 2012.
DOI : 10.1145/2090236.2090255

URL : http://arxiv.org/pdf/1104.3913.pdf

T. Eiter, T. Lukasiewicz, and L. Predoiu, Generalized consistent query answering under existential rules, International Conference on Principles of Knowledge Representation and Reasoning (KR), pp.359-368, 2016.

C. Elsenbroich, O. Kutz, and U. Sattler, A case for abductive reasoning over ontologies, International Workshop on OWL (OWLED), volume 216 of CEUR Workshop Proceedings, 2006.

R. Fagin, B. Kimelfeld, F. Reiss, and S. Vansummeren, Document Spanners, Journal of the ACM, vol.62, issue.2, p.12, 2015.
DOI : 10.1007/978-3-642-59136-5_2

J. Feldman, S. Muthukrishnan, A. Sidiropoulos, C. Stein, and Z. Svitkina, On distributing symmetric streaming computations, Symposium on Discrete Algorithms (SODA), pp.710-719, 2008.
DOI : 10.1145/1824777.1824786

URL : http://www.mit.edu/~tasos/papers/mud_soda2008.pdf

E. Franconi, P. Guagliardo, M. Trevisan, and S. Tessaris, Quelo: an ontology-driven query interface, Workshop on Description Logics (DL), volume 745 of CEUR Workshop Proceedings, 2011.

N. D. Goodman, The principles and practice of probabilistic programming, Symposium on Principles of Programming Languages (POPL), pp.399-402, 2013.

G. Gottlob, S. Kikot, R. Kontchakov, V. V. Podolskii, T. Schwentick et al., The price of query rewriting in ontology-based data access, Artificial Intelligence, vol.213, pp.42-59, 2014.
DOI : 10.1016/j.artint.2014.04.004

G. Gottlob, C. Koch, and R. Pichler, Efficient algorithms for processing XPath queries, ACM Transactions on Database Systems, vol.30, issue.2, pp.444-491, 2005.
DOI : 10.1145/1071610.1071614

URL : http://www.cs.cornell.edu/~koch/download/vldb2002-post.pdf

G. Gottlob, G. Orsi, and A. Pieris, Query Rewriting and Optimization for Ontological Databases, ACM Transactions on Database Systems, vol.39, issue.3, pp.1-25
DOI : 10.1007/s13740-012-0017-6

URL : http://arxiv.org/pdf/1405.2848

G. Gottlob and P. Senellart, Schema mapping discovery from data instances, Journal of the ACM, vol.57, issue.2
DOI : 10.1145/1667053.1667055

URL : https://hal.archives-ouvertes.fr/inria-00537238

B. Groz, T. Milo, and S. Roy, On the complexity of evaluating order queries with the crowd, IEEE Data Eng. Bull, vol.38, issue.3, pp.44-58, 2015.

J. Peter, J. M. Haas, and . Hellerstein, Ripple joins for online aggregation, International Conference on Management of Data (SIGMOD), pp.287-298, 1999.

J. M. Hellerstein, P. J. Haas, and H. J. Wang, Online aggregation, International Conference on Management of Data (SIGMOD), pp.171-182, 1997.

M. Hepp, The Web of Data for E-Commerce: Schema.org and GoodRelations for Researchers and Practitioners, International Conference on Web Engineering (ICWE), pp.723-727, 2015.
DOI : 10.1007/978-3-319-19890-3_66

X. Hu and K. Yi, Towards a Worst-Case I/O-Optimal Algorithm for Acyclic Joins, Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS '16, 2016.
DOI : 10.1145/2594538.2594552

T. Imielinski and W. Lipski, Incomplete Information in Relational Databases, Journal of the ACM, vol.31, issue.4, pp.761-791, 1984.
DOI : 10.1145/1634.1886

K. Jasinska, K. Dembczynski, R. Busa-fekete, K. Pfannschmidt, T. Klerx et al., Extreme F-measure maximization using sparse probability estimates, International Conference on Machine Learning (ICML). JMLR.org, 2016.

A. Kumar, J. , and D. Suciu, Probabilistic databases with MarkoViews, Proceedings of the VLDB Endowment, pp.1160-1171, 2012.

M. Kaminski and E. V. Kostylev, Beyond well-designed SPARQL, International Conference on Database Theory (ICDT), volume 48 of LIPIcs, pp.1-5, 2016.

A. Kandel, J. M. Paepcke, J. Hellerstein, and . Heer, Enterprise Data Analysis and Visualization: An Interview Study, IEEE Transactions on Visualization and Computer Graphics, vol.18, issue.12, pp.2917-2926, 2012.
DOI : 10.1109/TVCG.2012.219

URL : http://vis.stanford.edu/files/2012-EnterpriseAnalysisInterviews-VAST.pdf

P. 72-paraschos-koutris, D. Beame, and . Suciu, Worst-case optimal algorithms for parallel query processing, International Conference on Database Theory (ICDT), volume 48 of LIPIcs, pp.1-8, 2016.

D. Lembo, J. Mora, R. Rosati, D. F. Savo, and E. Thorstensen, Mapping Analysis in Ontology-Based Data Access: Algorithms and Complexity, International Semantic Web Conference (ISWC), pp.217-234
DOI : 10.1007/978-3-642-41335-3_35

M. Lenzerini, Data integration, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '02, pp.233-246, 2002.
DOI : 10.1145/543613.543644

J. Lerman, Big Data and Its Exclusions, SSRN Electronic Journal, vol.66, 2013.
DOI : 10.2139/ssrn.2293765

F. Li, B. Wu, K. Yi, and Z. Zhao, Wander Join, Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, pp.615-629, 2016.
DOI : 10.1145/2588555.2588579

. Libkin, SQL???s Three-Valued Logic and Certain Answers, ACM Transactions on Database Systems, vol.41, issue.1, 2016.
DOI : 10.1145/126482.126487

L. Libkin, Certain answers as objects and knowledge, Artificial Intelligence, vol.232, pp.1-19, 2016.
DOI : 10.1016/j.artint.2015.11.004

R. Liu, R. Vaculín, Z. Shan, A. Nigam, and F. Y. Wu, Business artifactcentric modeling for real-time performance monitoring, International Conference on Business Process Management (BPM), pp.265-280, 2011.

R. Marin, R. Hull, and . Vaculín, Data Centric BPM and the Emerging Case Management Standard: A Short Survey, Business Process Management Workshops, pp.24-30, 2012.
DOI : 10.1007/978-3-642-36285-9_4

W. Martens, F. Neven, and S. Vansummeren, SCULPT, Proceedings of the 24th International Conference on World Wide Web, WWW '15, pp.702-720, 2015.
DOI : 10.14778/2002938.2002939

Z. Moffitt, J. Stoyanovich, S. Abiteboul, and G. Miklau, Collaborative access control in WebdamLog, International Conference on Management of Data (SIGMOD), pp.197-211, 2015.

D. Morales and A. Bifet, SAMOA: Scalable advanced massive online analysis, Journal of Machine Learning Research, vol.16, pp.149-153, 2015.

C. Muñoz and M. Smith, Big data: A report on algorithmic systems, opportunity, and civil rights Executive Office of the President, The White House URL: https://www.whitehouse.gov/sites, 2016.

H. Q. Ngo, E. Porat, C. Ré, and A. Rudra, Worst-case optimal join algorithms, Proceedings of the 31st symposium on Principles of Database Systems, PODS '12, pp.37-48, 2012.
DOI : 10.1145/2213556.2213565

URL : http://www.cse.buffalo.edu/%7Ehungngo/papers/paper49.Ngo.pdf

N. Ngo, M. Ortiz, and M. Simkus, Closed predicates in description logics: Results on combined complexity, International Conference on the Principles of Knowledge Representation and Reasoning (KR), pp.237-246, 2016.

N. S. Nigam and . Caswell, Business artifacts: An approach to operational specification, IBM Systems Journal, vol.42, issue.3, 2003.
DOI : 10.1147/sj.423.0428

D. Olteanu and J. Závodný, Size Bounds for Factorised Representations of Query Results, ACM Transactions on Database Systems, vol.40, issue.1, pp.10-1145, 2015.
DOI : 10.14778/2536222.2536232

F. Pezoa, J. L. Reutter, and F. Suarez, Martín Ugarte, and Domagoj Vrgoc. Foundations of JSON schema, International Conference on World Wide Web (WWW), pp.263-273, 2016.

A. Poggi, D. Lembo, D. Calvanese, G. D. Giacomo, M. Lenzerini et al., Linking Data to Ontologies, J. on Data Semantics, pp.133-173, 2008.
DOI : 10.1007/978-3-540-77688-8_5

URL : http://www.dis.uniroma1.it/~degiacom/papers/2008/JODS08.pdf

Y. Prabhu and M. Varma, FastXML, Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '14, pp.263-272, 2014.
DOI : 10.1145/2623330.2623651

M. Riondato, M. Akdere, U. Cetintemel, S. B. Zdonik, and E. Upfal, The VC-Dimension of SQL Queries and Selectivity Estimation through Sampling, European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD), pp.661-676, 2011.
DOI : 10.1145/375663.375724

R. Salakhutdinov and G. E. Hinton, Semantic hashing, International Journal of Approximate Reasoning, vol.50, issue.7, pp.969-978, 2009.
DOI : 10.1016/j.ijar.2008.11.006

URL : https://doi.org/10.1016/j.ijar.2008.11.006

A. Das-sarma, A. G. Parameswaran, and J. Widom, Towards Globally Optimal Crowdsourcing Quality Management, Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, pp.47-62, 2016.
DOI : 10.1145/1559845.1559870

M. Schleich, D. Olteanu, and R. Ciucanu, Learning Linear Regression Models over Factorized Joins, Proceedings of the 2016 International Conference on Management of Data, SIGMOD '16, pp.3-18
DOI : 10.14778/2809974.2809991

URL : https://hal.archives-ouvertes.fr/hal-01330113

A. Schölkopf and . Smola, Learning with Kernels: Support Vector Machines, Regularization , Optimization, and Beyond, 2001.

W. Shen, A. Doan, J. F. Naughton, and R. Ramakrishnan, Declarative information extraction using datalog with embedded extraction predicates, International Conference on Very Large Data Bases (VLDB), pp.1033-1044, 2007.

J. Shin, S. Wu, F. Wang, C. D. Sa, C. Zhang et al., Incremental knowledge base construction using deepdive URL: http://www.vldb.org/pvldb/vol8/p1310-shin.pdf. 100 Slawek Staworko Complexity and expressiveness of shex for RDF, Proceedings of the VLDB Endowment International Conference on Database Theory (ICDT), volume 31 of LIPIcs Schloss Dagstuhl ? LZI, pp.1310-1321, 2015.

S. Staworko, J. Chomicki, and J. Marcinkowski, Prioritized repairing and consistent query answering in relational databases, Annals of Mathematics and Artificial Intelligence, vol.30, issue.3, pp.209-246, 2012.
DOI : 10.1145/1093382.1093385

URL : https://hal.archives-ouvertes.fr/hal-00643104

J. Stoyanovich, S. Abiteboul, and G. Miklau, Data responsibly: Fairness, neutrality and transparency in data analysis, International Conference on Extending Database Technology (EDBT), pp.718-719, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01290695

D. Suciu, D. Olteanu, C. Ré, and C. Koch, Probabilistic Databases. Synthesis Lectures on Data Management, 2011.
DOI : 10.1145/1388240.1388260

J. Sun, J. Su, and . Yang, Universal Artifacts, ACM Transactions on Management Information Systems, vol.7, issue.1, p.2016
DOI : 10.1007/11914853_10

L. Sweeney, Discrimination in online ad delivery, Communications of the ACM, vol.56, issue.5, pp.44-54
DOI : 10.1145/2447976.2447990

V. Balder-ten-cate, P. G. Dalmau, and . Kolaitis, Learning schema mappings, ACM Trans. Database Syst, vol.38, issue.4, p.28, 2013.

F. Tschorsch and B. Scheuermann, Bitcoin and Beyond: A Technical Survey on Decentralized Digital Currencies, IEEE Communications Surveys & Tutorials, vol.18, issue.3, 2015.
DOI : 10.1109/COMST.2016.2535718

G. Leslie and . Valiant, A bridging model for parallel computation 110 LG. Valiant. A theory of the learnable, Commun. ACM Commun. ACM, vol.33, issue.1711 111, pp.103-1111134, 1984.

L. Todd and . Veldhuizen, Triejoin: A simple, worst-case optimal join algorithm, International Conference on Database Theory (ICDT), pp.96-106, 2014.

Q. Weinberger, A. Dasgupta, J. Langford, A. Smola, and J. Attenberg, Feature hashing for large scale multitask learning, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.1113-1120, 2009.
DOI : 10.1145/1553374.1553516

URL : http://arxiv.org/pdf/0902.2206