Hi! What can I help you with?
Q: How can I donate to support The New Atlantis?
A: You can donate to The New Atlantis by choosing an amount ($50, $100, $250) and reading a note from the editor on what it takes to sustain the work.
Q: How can I subscribe to The New Atlantis?
A: You can subscribe to The New Atlantis for early access to new articles and subscriber-only content. Options include print + digital for $34 or digital for $24. You can also renew an existing subscription.
Q: What is reinforcement learning from AI feedback (RLAIF)?
A: RLAIF, also known as "scalable oversight," involves using AI models themselves to provide feedback for training, rather than relying on human preferences. This approach is cheaper and potentially more effective.
Q: What is reinforcement learning from human feedback (RLHF)?
A: RLHF involves collecting human preferences for how a model should respond to prompts. This preference data is used to train a separate neural network called a reward model, which grades the language model’s outputs with a predicted “human satisfaction” score.
Q: What is Constitutional AI?
A: Constitutional AI is an approach by Anthropic that embeds human preferences in a set of written principles, the constitution. The AI model generates responses to prompts, critiques, and revises them based on these principles, and uses this process to further train itself.