Skip to content
View yasumorishima's full-sized avatar

Block or report yasumorishima

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yasumorishima/README.md

Yasunori Morishima (盛島康徳)

Manufacturing Engineer & Data Analyst with 18 years of experience, specializing in data analysis, open source contribution, and business automation. (製造業にて18年の経験を持つエンジニア。データ分析・OSS貢献・業務自動化を専門としています)

In Production: Minami Baseball OB — Full-stack alumni site with 10 active users (38 pages · 22 DB tables · 130 e2e tests · ¥0/mo running cost) | Docs

Findy Skill Deviation 2026-04-18時点: 69

🌍 Realtime Open Data

Japan Geohazard Monitor Persian Gulf Ship Tracker
31 geophysical data sources → ML earthquake prediction (AUC 0.754, CSEP Molchan 0.981) + real-time monitoring dashboard AIS vessel tracking across the Persian Gulf & Gulf of Oman with land mask filtering

Real-time API / WebSocket → SQLite → FastAPI + Leaflet.js (dark theme)All projects

Japan Geohazard Monitor — Earthquake prediction research: 76 features from 25+ sources (USGS, Earthdata, INTERMAGNET, NMDB, NOAA, IOC), walk-forward HistGBT + stacking, BigQuery data platform (216K rows). Weekly automated CI pipeline on GitHub Actions.


⚾ Baseball Analytics

Prediction Systems

Project Description Demo
NPB Season Prediction Bayesian ensemble (Marcel 35% + Stan/Ridge 40% + ML 25%) + Monte Carlo team simulation + 24 foreign player individual projections Live
NPB 2021 Backtest Could Bayesian model predict Yakult & Orix last→champion? 25 foreign players with FanGraphs data Analysis
MLB Win Probability Engine 3-engine ensemble WP (Normal + Empirical + LightGBM) + Gemini AI commentary Live
Baseball MLOps Pipeline Statcast × GCP MLOps: 5-model ensemble + BQML, weekly auto-retrained on BigQuery + Cloud Run Live
MLB Data Pipeline Shared BigQuery data platform — FanGraphs + Savant + Statcast for all baseball analytics projects
Prediction accuracy & details
System Key Metric Articles
NPB 2026 8-yr backtest wOBA MAE .0498, 97% prob. of beating Marcel. 10K Monte Carlo sims JP / EN
NPB 2021 Backtest MAE 10.7W — Yakult & Orix last→champion driven by JP player breakouts, not foreign players Repo
MLB WP Engine 3-engine ensemble, 367K+ play states (2015–2024), inverse-Brier weighted + Isotonic calibration Live
Baseball MLOps Batter wOBA MAE .0287 (Marcel: .0326) / Pitcher xFIP MAE 0.483 (Marcel: 0.558) Live

Biomechanics

Baseball Skeleton Analysis — 3D skeleton visualization from Driveline OpenBiomechanics C3D data

Pitching Skeleton (3D C3D) Hitting Skeleton (3D C3D)

Trunk rotation range vs pitch speed: r=0.425 (strongest). Contributed bug fix PR #384 to ezc3d. Article (JP) / Article (EN)

Apps (in development)

Amateur Baseball Scorebook OCR — Photo-to-stats pipeline for amateur team scorebooks. Moonshot: Claude Opus 4.7 Vision reading specialized Japanese scorebook notation (4-3, F7, diamond shading for basepath progression) end-to-end.

Statcast Analysis

6 analyses covering Japanese MLB pitchers and Ohtani batting data.
All analyses (6)
Analysis Key Finding Article
Kikuchi Slider Revolution (2019-2025) SL 17%→37% after Astros trade Zenn / DEV.to / Kaggle
Senga Ghost Fork (2023-2025) FO whiff rate 58%→39%, decline pre-injury Zenn / DEV.to / Kaggle
Imanaga 2nd Year (2024-2025) 3-pitch concentration (97%), 1st TTO xwOBA .505 Zenn / DEV.to / Kaggle
Darvish Evolution (2021-2025) SL/ST halved, CU became putaway pitch Zenn / DEV.to / Kaggle
Ohtani Spray Chart spraychart() one-liner vs matplotlib manual Zenn
Ohtani Heatmap Stadium drawing + hit density heatmap Zenn

🌐 Open Source Contributions

(64 PRs / 32 Merged) across 28 repositories. See [oss-contributions](https://github.com/yasumorishima/oss-contributions) for full details.
PR highlights (click to expand)
Repository PR Description
dfinity/icp-js-core #1270 Improve Candid decode error messages
dfinity/icp-js-core #1277 Deduplicate parallel fetchSubnetKeys
dfinity/pic-js #235 Add fetchCanisterLogs() method
line/line-bot-mcp-server #369 Add get_follower_ids tool
pyomeca/ezc3d #384 Fix __eq__ early return bug
optuna/optuna Hyperparameter optimization framework
pandas-dev/pandas Data analysis library
jldbc/pybaseball #498-504 Bug fixes & documentation
team-mirai — Civic Tech OSS (21 PRs (11 Merged / 2 Open / 8 Closed))

Civic tech projects for political transparency & citizen participation in Japan. Next.js / TypeScript / Supabase / Vitest.

Repository Highlights
action-board 48 unit tests, RPC tests, breadcrumb nav, cache fix + 5 more
marumie Category filter total
post-checker Timezone fix
fact-checker X API investigation + 5 PRs

📊 Data & Competitions

Kaggle

Notebooks Expert | 🥉 14 Bronze Notebook Medals

Active: Deep Past (Akkadian→English) / BirdCLEF+ 2026 (Bird Sound Recognition)

Bronze Medal Notebooks (14)
Notebook Topic
savant-extras Defense & Pitching Quality Defense metrics & pitching quality (savant-extras)
MLB Statcast Spray Charts for WBC 2026 WBC 2026 spray + pitch zone charts (baseball-field-viz)
March Machine Learning Mania 2026 NCAA tournament prediction (LightGBM)
NFL Geometric Rules Baseline Physics-based rules, No ML, RMSE 2.921
CAFA 6 Baseline Protein function prediction (PyTorch MLP)

All 14 notebooks →

Kaggle Datasets

10 published MLB datasets
Dataset Description
🥈 MLB Bat Tracking Leaderboard (2024-2025) 452 batters, 19 swing metrics
🥈 WBC 2026 Scouting 306 players, 20 countries
Other datasets (4)
Dataset Description
Baseball Savant Leaderboards (2024-2025) 15 leaderboards, 2 seasons combined
Japanese MLB Players Statcast (2015-2025) 34 Japanese MLB players, 174k pitches+hits
MLB Pitcher Arsenal Evolution (2020-2025) 4,253 pitcher-seasons, 111 metrics
MLB Statcast + Bat Tracking (2024-2025) Combined Statcast + bat tracking data

DrivenData

DrivenData Competitions — Automated pipeline: GitHub Actions + GPU training + GPU→CPU fallback. Currently competing in On Top of Pasketti (Children's ASR, $120K prize, Wav2Vec2 CTC).


🌐 Web & Apps

Minami Baseball OB — Alumni Association Site

Full-stack web app for a high school baseball alumni association (135 files, ~15900 LOC). Technical Documentation

PC (Light) Game Results

5-tier RBAC (Middleware + RLS), automated member pipeline (Form → GAS → Actions → Supabase), 681 match records (1955–present)

Architecture & features
  • 5-tier RBAC (guest → admin): Next.js Middleware + Supabase RLS — authorization at route, row, and component level
  • Automated member pipeline: Google Form → Apps Script → GitHub Actions auto-PR → Supabase role sync. Personal names never touch Git
  • Custom CMS: 8 editor pages + 5 inline edit, soft delete (7-day trash + auto-purge), change history, audit logs
  • 681 match records (1955–present): cross-source verification, generation-based grouping, per-game photo management
  • UX: Unsaved warning, Web Share + LINE fallback, Calendar registration, ripple feedback, Suspense skeleton UI, weather forecast (Open-Meteo, 10 venues), automated game detection (2 sources → auto-PR)
  • Security: RLS on all 19 tables, server-only admin, CODEOWNERS, branch protection, secret scanning, cookie consent, 60-min session timeout

Next.js 15 / TypeScript 5.8 / Tailwind CSS 4 / Supabase (PostgreSQL + Auth + Storage) / Vercel / GitHub Actions / Google Apps Script / GA4

Dashboards & Mobile

App Description Link
MLB Bat Tracking Dashboard Leaderboard, Player Comparison, Team Lineup Builder. Powered by savant-extras Live
WBC 2026 Scouting Dashboard 30 Statcast apps across 19 countries. Zone heatmaps, spray charts, pitch movement Live
Daily Diary Flutter mobile app, 5 languages, offline-first, AdMob Google Play
WBC 2026 Scouting Dashboard details (30 apps)

30 Statcast scouting apps across 19 countries (batters + pitchers). Zone heatmaps, spray charts, pitch movement, LHP/RHP splits. Auto-fetched via GitHub Actions. → USA Batters / Japan Pitchers / All 30 apps


📦 PyPI Packages

6 packages (click to expand)
Package Description
savant-extras 17 Baseball Savant leaderboards + date range support. Complements pybaseball
baseball-field-viz Statcast coordinate transform + field drawing + spray charts + pitch zone charts
kaggle-notebook-deploy Deploy Kaggle Notebooks via git push + GitHub Actions
kaggle-wandb-sync Sync W&B offline runs from Kaggle to W&B cloud
signate-deploy SIGNATE competition workflow via GitHub Actions
signate-wandb-sync Record SIGNATE scores to W&B runs

🔬 Learning Projects

Project Description
ICP Learning Project Persistent counter dApp on Internet Computer (Motoko, dfx CLI)
OpenClaw Twitter Bot Raspberry Pi 5 + OpenClaw + Gemini API auto-tweet bot (stopped) — Article (JP)
Past Projects
Project Description
GAS Calendar Tool Batch calendar event registration with senior-friendly mobile UI
Dune Analytics On-chain data analysis — JPYC Stablecoin Dashboard
selenium-to-playwright 🔒 Playwright browser automation: 20+ scripts (樂天/docomo/Yahoo/LINE/ICP) + night batch runner with auto GitHub Issues
Archived Projects Selenium automation, business workflow tools, etc.

🛠️ Tech Stack

Category Technologies
Data Analysis & ML Python, pandas, scikit-learn, LightGBM, XGBoost, CatBoost, PyTorch, matplotlib, seaborn, DuckDB, W&B
Data Platform Google BigQuery (8 datasets, 96+ tables — baseball, geohazard, ship tracking), BigQuery ML, Cloud Run, Grafana
Data Sources Baseball Savant (Statcast), pybaseball, USGS, NASA Earthdata, AIS
Web & Dashboards Streamlit, Next.js, TypeScript, Supabase, Vercel, shadcn/ui
Mobile App Flutter, Dart, Hive, Google AdMob
Automation & DevOps GitHub Actions, Google Apps Script, VBA, Power Query
Tools Claude Code, Kaggle, Google Colab, Excel, Looker Studio
Manufacturing Statistical Quality Control, Process Engineering

📈 Career

  • 2024 - Present: Quality Management @ Marubun Corporation (丸文株式会社)
  • 2008 - 2024: Semiconductor Manufacturing Process Engineer (半導体製造プロセスエンジニア)

🏆 Patents

Stencil mask and manufacturing method thereof (ステンシルマスク及びその製造方法)

  • Patent No: 6307851 (特許第6307851号)
  • Role: Inventor (発明者)
  • Assignee: Toppan Printing Co., Ltd. (凸版印刷株式会社)
  • Link: Google Patents (JP6307851B2)

📫 Contact & Blog

Pinned Loading

  1. kaggle-datasets kaggle-datasets Public

    Baseball-themed Kaggle datasets generated with pybaseball

    Python

  2. mlb-statcast-visualization mlb-statcast-visualization Public

    MLB Statcast data visualization with pybaseball - 3 methods to draw baseball fields (spraychart, matplotlib, sportypy)

    Python

  3. oss-contributions oss-contributions Public

    My open source contributions tracker

    Python

  4. wbc-scouting wbc-scouting Public

    Python