Data Exploration I¶
Collaborative paper discussion¶

Mental Model for Reviewers.
The Data Science Pipeline & Hacker's Demo¶
Hack-time¶
Tip
To load and use a notebook in VS Code follow the steps 3 to 5 in 📘 Notebooks in VS Code
Working with Data¶
-
The coding quest continues. Open the notebook of the day in VS Code:
For next time¶
Everyone, remember:
-
you can ask me for help on WhatsApp.
-
you can and should all work on each of these tasks together.
-
don't wait until you're the hackers to practise code, nor until you're the writers to get involved in writing, nor until you're the reviewers to give feedback. You should all be involved at every stage.
Writing a scientific paper is not easy. It takes a lot of self-discipline and good communication to make it work. You have been making lots of progress and I'm confident you'll manage it!
Writers¶
- Integrate your
main.typfile with the starter-journal-article.- I've integrated some initial code into the
main.typfile already. You need to update the template. - Don't do this alone or only among the writers. Keep the others involved and explain to the reviewers and hackers how they can use a template.
- I've integrated some initial code into the
- Add a title and a short abstract (fewer than 250 words) using the template placeholder.
- In the abstract you need to answer 5 questions:
- What are you doing?
- Why does it matter?
- What have others been doing on the topic?
- What are you doing that's better?
- What do you expect to find?
- In the abstract you need to answer 5 questions:
- Fix all missing Typst syntax citations using the
@authorYEARkeyword.- In the
main.typ, especially in the first four paragraphs, we seem to have shifted to manually written citations. This sounds a lot like ChatGPT. Make sure to use the correct Typst syntax. - We should use Typst formatting all the time!
- Also remove any sentences like
= Summary Group 1or anything of the sort.
- In the
Deliverable:
- Updated
main.typand compiled PDF shared in the General Chat with the group. - A 5-minute overall presentation walking through the current version of the paper in the
main.typfile.- Don't talk about the structure (this is for the reviewers) or what you did in a specific part. We want to discuss the paper itself. What is it about? What are the current problems? How are we going to fix them?
Going the extra mile (Optional):
A figure has been integrated into the main.typ, but if you include a figure in a scientific paper, you should refer to it in the text and explain what it shows.
- Make sure to use the
#figurefunction (see the first example here). - Add a figure caption.
- Make a reference to that figure (again see (see the first example here).
Hackers¶
- Create a notebook to explore relevant variables.
- Produce an
.ipynbfile that generates descriptive statistics for the quantities of interest (QOIs).- You will mostly use
df['some_variable'].value_counts(). - Start with some of the variables listed by this week's hackers notebook:
variables = ["cntry", "prtdgcl", "clsprty", "lrscale", "prtclffr"] - Integrate other relevant SES variables (e.g. age, sex, education, etc.).
- You can/should also take inspiration from previously published scientific papers.
- You will mostly use
- You should build your notebook starting from last week's notebook:
Going the extra mile (Optional):
Can you create a simple figure of the distribution of ideology using the pandas plotting function?
- See pandas.DataFrame.plot.
- Tip: you will have to install matplotlib
!pip install matplotlib
Reviewers¶
5-Minute Brief
- Use the “I like, I wish, I wonder” model to give constructive feedback.
- Identify the top 3 current risks in the paper.
- Always refer to the
main.typfile.
Going the extra mile (Optional):
- Assist the Writers and Hackers with integrating any feedback received during class!