December was a busy month reading all sorts of stuff!
#ai #benchmark
-
2024-12-05 - LiveBench
A platform for contamination-free benchmarking of large language models. -
2024-12-05 - LiveBench/LiveBench
Repository of LiveBench, offering resources for benchmarking LLMs. -
2024-12-20 - ARC Prize - What is ARC-AGI?
Explores benchmarks for testing AGI potential with ARC-AGI metrics. -
2024-12-20 - FrontierMath | Epoch AI
Details AI benchmarks focused on mathematical problem-solving capabilities. -
2024-12-20 - GPQA: The “Diamond Standard” for AI
A newsletter discussing high-standard benchmarks for advanced AI systems. -
2024-12-20 - Problemset - Codeforces
A collection of coding problems used for AI benchmarking in competitive programming. -
2024-12-20 - SWE-bench
A benchmark suite focused on evaluating software engineering capabilities in AI systems. -
2024-12-20 - [2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Academic paper introducing a rigorous Q&A benchmark for AI.
#ai #applications
-
2024-12-22 - GitHub - curiousily/AI-Bootcamp
Self-paced tutorials on ML fundamentals and generative AI technologies. -
2024-12-21 - GitHub - virattt/ai-hedge-fund
Repository for building AI systems to operate hedge funds. -
2024-12-24 - GitHub - anti-work/shortest
QA platform leveraging natural language AI for automated testing. -
2024-12-24 - pipecat/examples/foundational/22d-natural-conversation-gemini-audio.py
Script for natural language conversation with Gemini AI via audio interface.
#code
-
2024-12-12 - GitHub - elisspace/weatherdisplay
Create an e-ink display for weather and tide tracking using a Raspberry Pi. -
2024-12-20 - ObjectDetect/detect.py at main
Python code for object detection implementations and examples. -
2024-12-26 - pipecat/examples/foundational/22d-natural-conversation-gemini-audio.py
Python script illustrating foundational natural conversation workflows.
#invest
-
2024-12-06 - Bilbel Capital - Media
Insights into Bilbel Capital’s investing strategies and media resources. -
2024-12-13 - Prosus
Information on global investment opportunities through Prosus.
#product
-
2024-12-11 - Big Ideas in Tech for 2025 | Andreessen Horowitz
Predictions for groundbreaking trends in the tech industry for 2025. -
2024-12-13 - Founder-Style Leadership - Silicon Valley Product Group
Discusses the unique leadership style embraced by startup founders.
#news
-
2024-12-10 - Ongoing Phishing and Malware Campaigns in December 2024
Summary of major cyber threats and campaigns reported in December. -
2024-12-11 - Prevent factual errors from LLM hallucinations
AWS introduces tools to minimize factual errors in AI outputs.