Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
1
Where to Start? Reference Chase
z Don’t be afraid to read journal papers z Don’t trap into the “Exponential Reference
z DB field is a fast-moving discipline: Chase” problem
z Latest techniques appear in conference/workshop
z More mature work appears in journal Paper to read
z Although longer than conference version, often in queue
easier to read
z Lots of examples, figures, descriptions, …
z Examples:
z ACM TODS, ACM TOIS, VLDB J., IEEE TKDE, ACM TOIT
z ACM Computing Survey, C. ACM, SIGMOD Record, …
Papers that you read
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
2
Example: XML-Relational
Conversion Problem 3. DELTA Approach
z Arguably easiest…
Schem Cons Query View Trigg Secur Top-K Temp Spatia z Pick one paper of your interest
a traint ers ity oral l
z Read a lot – more than 10 times
XML
Æ
z Find limitations and Extend it by DELTA
Relati O O O O O O z Prove or demonstrate that
onal
(40+) (5+) zThe limitation that you pointed out is valid
Relati
zYour suggestion improved the problem by DELA
onal z The more well-known work you choose, the harder
Æ O O O O to improve, but the better for your reputation…
XML
z Eg, “E.F. Codd’s relational model is insufficient to handle
semi-structured model because…”
z The bigger the DELTA is, the better your paper
gets
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
4. DROP Approach
Which DELTA to Choose (adopted from J. Widom’s slides)
z Pick the DELTA that is the most significant z Pick a simple but fundamental assumption
z Some criteria are: underlying traditional database systems
z Have practical values z DROP it
z Has motivational scenario as of NOW, or z Reconsider all aspects of data management
z Predicted to be useful in N years and query processing
z Many Ph.D. theses
z Non-trivial
z Prototype from scratch
z Hot topics:
z Streaming, XML, Sensor, …
From http://www-db.stanford.edu/~widom/stream.ppt
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
3
Example: Two Stanford
Projects Where to Submit?
z The LORE Project z Top-down
z Dropped assumption: z Aim at the best conference in the field
“Data has a fixed schema declared in advance” z If rejected, go to next-tier conference or symposium
z If rejected, go to next…
z Semi-
Semi-structured data (→
(→ XML)
z Bottom-up
z The STREAM Project
z Aim at workshop
z Dropped assumption: z If accepted, work more and aim at better one (symposium
“First load data, then index it, then run queries” or 2nd-tier conference)
z Continuous data streams (+ continuous queries) z After making sure that the ideas mature enough, aim at the
best conference
From http://www-db.stanford.edu/~widom/stream.ppt
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
4
Good Introduction How to Write an Intro
1. What’s the problem? 1. Start with 5 bullets
2. Why is it important?
z What’s the problem?
z Mention some application, existing problems
3. Why is it difficult? z Why is it interesting?
1. Ask some not-very-obvious questions or explain naïve z …
approach
4. What others did? 2. 1-2 sentence answer to each question
5. What’s my contribution? 3. Add more content
1. Contribution bullet list (paper organization)
6. Build some excitement/surprise 4. Spend enough time on intro
1. Keep reading! You will find something interesting later 1. Bullet points enough
7. Every word should be carefully picked
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
Importance of Personal
Research Log Start Writing Early On…
z Maintain personal research log z Even if you feel you are NOT ready yet
z Sketch your research ideas into a writing z Your advisor will throw away your initial draft
z Update your ideas as time passes anyway
z Occasionally go back to old writings z Your initial submission will be rejected anyway
z Prepare a short review for each paper that you read z But you get
z Summary
z (good or bad) Experiences and learn from that
z Pros and cons
z Limitations or problems
z Writing sharpens your ideas and gives more ideas
z If needed, contact authors and ask questions z Writing can be improved only via writing
z Usually authors are willing to discuss with their readers
DB Seminar Talk, 2005, Dongwon Lee DB Seminar Talk, 2005, Dongwon Lee
5
Fabrication and Plagiarism dbworld
z “Prominent Physicist Fired for Faking Data • Be a member of dbworld newsgroup
Research: Bell Labs says scientist 'recklessly'
misrepresented work on microprocessors…” (2002, – http://www.cs.wisc.edu/dbworld/
LA Times)
z http://www.latimes.com/news/science/la-sci- – Free membership
physicist26sep26.story – Keep track of DB-related news
z “Constantinos V. Papadopoulos got caught
plagiarism at EUROPAR (1995)… 7 papers
published and 8 under submission… all plagiarized
from Technical Reports…”
z http://www.sics.se/europar95/plagiarism.html