Jacques Robin's Past Professional Activities (other than teaching)

Research

Advising

Software Development

Technology Transfer

References

Research

After early exposure to research working on expert systems based on temporal logic, rules and frames as a software engineer trainee at IBM and Dassault research centers in France, my research career per-se started at Columbia University, New York where I tackled three key issues in Natural Language Generation (NLG):

(1)     the generation of coordinated text and graphics in multimedia tutorials and data summaries,

(2)     the generation of concise and semantically dense news summaries, by opportunistically fitting background information as short modifying phrases in carefully chosen locations inside a draft focusing only on the key new facts to report, and

(3)     a methodology for knowledge acquisition and quantitative output evaluation of NLG systems based on model text corpus analysis. This work resulted in 12 publications including in one in Artificial Intelligence  and one in Computational Linguistics, in addition to my PhD. thesis (cited in 55 publications indexed on Google Scholar).

Once a post-doctoral researcher and then professor at Universidade Federal de Pernambuco (UFPE) in Brazil, I initially investigated how to integrate NLG with other cutting-edge technologies to provide innovative, user-friendly decision support services. In the first project on this track, NLG was used to summarize insights resulting from mining OLAP data warehouses in short textual reports directly usable by executives for decision making. This work resulted in 5 publications, one PhD. thesis and 4 Master's theses. In the second project on this track, Model-Driven Engineering (MDE) and web services were used to seamlessly integrate NLG with recommendation systems and VXML voice portals to ubiquitously deliver personalized, dynamically generated, voice-browsable, entertainment tips through mobile phones. This work resulted in one PhD. thesis and 3 publications so far.

Over the same period, I also worked on parallel tracks addressing the following issues:

(1)     Improving web search by automatically expanding queries with keyword synonyms from a thesaurus,

(2)     Engineering methodology and CASE tools for agent teams (e.g., in robotic soccer simulations) that leveraged the complementary strengths of object-oriented modeling and logic programming (using UML, OCL and the Flora-2 inference engine based on Transaction Frame Logic [16] [17] [18] [19] [20]), and

(3)     Inductive Object-Oriented Logic Programming to learn Flora-2 programs from training data.  Work on theses tracks resulted in 3 Master's Theses and 4 publications, including a Best Paper Award.

Albeit diverse, all the above decision support systems, web search and multi-agent simulations projects shared two aspects. First, they all attempted to engineer highly innovative, practical IT services by aggregating some form of cutting-edge Automated Reasoning (AR) to technologies from other fields (databases, distributed systems, software engineering). Second, their technology transfer from brittle proof of concepts prototypes to profitable industrial applications all hit one common stumbling block: the AR tools they relied upon were prohibitively hard to learn, use, interoperate, customize, and extend, mainly due to their monolithic software architecture, unfriendly interfaces and non-mainstream execution platforms.

Over the last four years, I thus refocused my research on removing this stumbling block, by elaborating the long-term project ORC2AS (Objects, Rules, Constraints and Components for Agents and Simulations).
It studies the low-cost engineering of easy to learn, use, debug, customize, extend and assemble Java or .Net components that encapsulate the most versatile yet scalable AR techniques. It covers the methodological (artifacts, process) technological (languages, components) and theoretical (formal semantics) axes of such study. Its key idea it to integrate in bi-directional synergy:

a.       Component-Based, Aspect-Oriented MDE leveraging OMG standards and

b.       Object-Oriented Rule-Based Constraint Programming.

Due to the ambitious scope and trans-disciplinary nature of this project, I first focused my efforts on creating an international network of collaborations to carry it out. This network currently aggregates a 25 strong team (9 senior researchers, 8 PhD. students and 8 MSc. students) in 7 institutions: UFPE, Universidade de Pernambuco, Université Pierre et Marie Curie, Universität Mannheim, Universität Ulm, Fraunhofer Gesellschaft, and INRIA. Parts of this project have been funded by CAPES, FACEPE, COFECUB, DAAD and INRIA. It has so far resulted in 8 publications.

Advising

PhD. Theses (sole supervisor)

1.       Franklin Ramalho. MODELOG: Model-Oriented Development with Executable Logic Object Generation. Centro de Informática, Universidade Federal de Pernambuco (CIn-UFPE), Recife, Brazil. 2007.

2.       Hendrik Macedo. A Software Architecture for Ubiquitous Web Browsing with Application to Recommendation Systems. CIn-UFPE, Recife, Brazil. 2006.

3.       Eloi Favero. Generating Hypertext Summaries of Quantitative Data in Multi-Dimensional Databases.
CIn-UFPE,
Recife, Brazil. 2000.

PhD. Thesis (co-supervisor)

1.       Rosely Lacerda. Gramática do Sintagma Nominal Português em um Formalismo Computacional. (A Computational Grammar of the Portuguese Noun Phrase), Departamento de Literatura e Linguística, Universidade Federal de Alagoas, Maceió, AL, Brazil. 2001.

Master's Theses (sole supervisor):

1.       Klebson Silva. Ambiente de Análise e Depuração de Sistemas Multi-Agente: Aplicação ao Desenvolvimento de Time de Futebol de Robô (Analysing and degugging multi-agent systems based on object-oriented logic with application to RoboCup), CIn-UFPE, Recife, Brazil. 2004.

2.       Erivan Andrade. Indução de Programas Lógicos Orientados a Objetos. (Induction of object-oriented logic programs). CIn-UFPE, Recife, Brazil. 2003.

3.       Alexandre Damasceno. Desenvolver Agentes de Futebol com Objetos e Lógica (Developing soccer agents with objects and logic). CIn-UFPE, Recife, Brazil. 2003.

4.       Fabio Pereira. Um Componente de Mineração de Exceções em Cubos OLAP (A component for outlier mining in OLAP databases).  CIn-UFPE, Recife, Brazil. 2003.

5.       Marcelino Pereira. SKDQL: Uma Linguagem Declarativa de Especificação de Tarefas e Consultas de Descoberta de Conhecimento em Banco de Dados e sua Implementação (SKDQL, a declarative language for specifying knowledge discovery queries and tasks and its implementation). CIn-UFPE, Recife, Brazil. 2002.

6.       Franklin Ramalho. Avaliando o Uso de um Tesaurus no Processo de Expansão Automática de Consultas para Engenhos de Busca na Web (Evaluation of thesaurus-based query expansion for web search engines). CIn-UFPE, Recife, Brazil. 2000.

7.       Natasha Queiroz. DOODI: uma API para integracao de bancos de dados multidimensionais e sistemas dedutivos (DOODI: An API to connect OLAP and object-oriented deductive databases). CIn-UFPE, Recife, Brazil. 2000.

8.       Robson Fidalgo. JODI: uma API Java para OLAP (JODI: a Java API to access OLAP databases). CIn-UFPE, Recife, Brazil. 2000.

      Bachelor's Degree Graduation Projects (sole supervisor):

1.       Marcos Aurelio da Silva. CHORD: Constraint Handling Object-oriented Rules with Disjunctive bodies. CIn-UFPE, Recife, Brazil. 2006.

2.       Luiz Lacerda. Interface Visual de Depuração de Time de Futebol de Robô (A GUI for Simulated Robotic Soccer Team Debugging). CIn-UFPE, Recife, Brazil. 2004.

3.       André Novaes. Estudo de Viabilidade de uma Implementação Oracle9i de um SGBD Dedutivo Orientado a Objeto no Formalismo F-Logic (Feasibility of implementing a Frame Logic deductive object-oriented databases in Oracle9i). CIn-UFPE, Recife, Brazil. 2002

Software Development

From 1990 to 1996, I co-developed with Michael Elhadad the open-source, reusable SURGE (Systemic Unification Realization Grammar of English) syntactic realization component for Natural Language Generation (NLG). Written in the high-level, declarative, sorted feature logic language FUF (Functional Unification Facility), SURGE encapsulates a wide coverage of the English grammar that computationally formalizes and integrates the linguistic theories and grammars of Halliday, Quirk at al. and others. It consists of over 7500 lines of FUF code. After the release of version 2.0, it quickly became to most widely adopted syntactic realization tool in the worldwide NLG research community, being cited in about half of all NLG publications over the period 1995-2001. It is now cited in 233 research publications and 10,300 web pages. It was also successfully reused in several industrial systems (cf. technology transfer section). The availability of FUF compilers to Lisp, C and Java make SURGE efficient and available on many platforms.

Technology Transfer

As a research consultant at Bellcore, New Jersey, in 1993-1995, I participated to the successful transfer of the NLG technology to automate production of required business documentation: (a) textual summaries of telecommunication equipment installation plans that justify pricing decisions to regulatory agencies or judges during consumer lawsuits, and (b) textual summaries justifying business reengineering plans to management of consultancy client firms. In the first project, the PLANDoc NLG component was integrated to an expert system generating a base plan and an alternative scenario simulator allowing an engineer to test base plan variations. It automatically generated the textual summary from traces of these two other components. In the second project, the FLOWDoc NLG component was integrated to a workflow modeling tool to acquire the client's current business process and an optimization tool suite to discover better business processes. It automatically generated the textual summary from the workflows of the current and proposed reengineered business processes. These two systems were put in profitable day to day use at Bellcore. PLANDoc generated documents that were used in court as evidence in a million dollar range lawsuit won by Bellcore. FLOWDoc generated documents that convinced several Bellcore clients to adopt the proposed reengineering of their business process.

In 2003, I was hired by CESAR -- a financially self-sustained IT consulting and start-up farm NGO and the largest organization in downtown Recife's IT organization cluster (Porto Digital) -- to head a consulting team which mission was to define the strategic mission, legal status, organizational structure, funding strategies and initial projects of I3 (Instituto de Inovação em Informática), a CESAR spin-off dedicated to connect local universities, companies, government agencies and NGO to carry out economically promising IT innovation projects. I3 is now playing a key role in Porto Digital to foster technology transfer and create profitable synergies in the regional IT ecosystem.

References

[1] http://www.siggen.org/

[2] http://citeseer.ist.psu.edu/robin96empirically.html

[3] http://portal.acm.org/citation.cfm?id=972696&dl=GUIDE

[4] http://citeseer.ist.psu.edu/robin94revisionbased.html

[5] http://www.dwreview.com/OLAP/Introduction_OLAP.html

[6] http://www.planetmde.org/

[7] http://en.wikipedia.org/wiki/Recommendation_system

[8] http://www.w3.org/TR/voicexml20/

[9] http://www.robocup.org/

[10] http://www.omg.org/technology/documents/formal/ocl.htm

[11] http://flora.sourceforge.net/

[12] http://images.google.com/imgres?imgurl=http://www.ir2001.oulu.fi/images/gallery/large/701.jpg&imgrefurl=http://www.ir2001.oulu.fi/gallery/7/701/&h=473&w=712&sz=74&hl=en&start=2&tbnid=pP2n2kLMQqoAQM:&tbnh=93&tbnw=140&prev=/images?q=jacques+robin+IR

[13] http://www.aosd.net/

[14] http://www.omg.org/

[15] http://www.cs.kuleuven.ac.be/~dtai/projects/CHR/

[16] Bonner, A. and Kifer, M. Transaction Logic programming (or, a logic of procedural and declarative knowledge). Technical Report. CSRI-323, Computer Systems Research Institute, University of Toronto, Canada, 1995.

[17] Bonner, A. and Kifer, M. Concurrency and Communication in Transaction Logic. Proceedings of the Joint International Conference and Symposium on Logic Programming. Bonn, 1996. MIT Press.

[18] Kifer, M. Deductive and Object Data Languages: A Quest for Integration. In Lecture Notes in Computer Science 1013. Springer. 1995.

[19] Kifer, M., Lausen, G. and Wu, J . Logical Foundations of Object-Oriented and Frame-Based Languages. Journal of the ACM 42. 1995.

[20] Yang, G. A Model Theory for Nonmonotonic Multiple Value and Code Inheritance in Object-Oriented Knowledge Bases. PhD. Thesis, Computer Science Department, Stony Brook University of New York, 2002.

 

 

 

© Jacques Robin 2007. Last Updated: 06/03/2007