고객 사례
When Video Became the Signal: How GS SHOP Drove a 57.5% Lift in Ordering Customers
GS SHOP is one of South Korea's leading omnichannel commerce platforms, spanning TV live channels, data home shopping, and mobile live commerce. Its Short Pick experience presents product highlights in approximately 30-second clips, bringing content and commerce together in a personalized mobile shopping journey. Part of GS Retail. gsshop.com
Key Outcomes at a Glance
The Challenge
Most "video recommendation" systems are simply traditional product recommendation engines with a video clip attached. GS SHOP's Search & Recommendation team realized their existing system had no way to interpret what a video actually meant to a viewer.
The platform relies heavily on "Short Pick" clips—30-second highlights manually curated from one-hour live broadcasts. For producers, finding specific visual moments (such as a host placing kimchi on boiled pork) meant manually scrubbing through hours of footage, routinely consuming an entire afternoon.
Furthermore, when a customer stopped scrolling to watch a clip, the legacy collaborative filtering engine remained blind to the visual context. If a customer watched a video featuring running shoes, the system simply surfaced more running shoes based on product category tags. It couldn't detect the styling, the lifestyle setting, or the specific aesthetic context that might indicate a latent interest in cross-category items like windbreakers, leggings, or outdoor gear. The video was functioning as a passive thumbnail rather than an active data signal.
Before committing to an infrastructural overhaul, GS SHOP initiated a focused proof-of-concept using TwelveLabs Marengo on Amazon Bedrock.
They stress-tested the model in Korean using a highly specific, multi-layered action query:
"A shopping host placing a piece of kimchi on boiled pork and eating it with chopsticks."
The query required the model to simultaneously process complex cultural vocabulary, identify the subjects, and track continuous action across frames. Marengo successfully located and returned the exact scene from a raw, one-hour broadcast tape in seconds.
This successful validation shifted the project from a cautious experiment to a full-scale production rollout. Building proprietary models in-house was ruled out due to the high costs of data labeling, GPU infrastructure, and specialized machine learning headcount. Because TwelveLabs models were natively accessible as fully managed APIs via Amazon Bedrock, GS SHOP could deploy advanced video intelligence instantly within their existing AWS environment without changing their vendor contract infrastructure.
Rather than replacing their validated product-based recommendation engine, GS SHOP extended it by layering a new video-context signal on top.

The Four-Axis Appeal Point Framework
To ensure the video signal captured context rather than just repeating product tags, Claude filters out brand and product names, mapping the video descriptive data across four distinct axes.
1
Functionality
Core product properties (e.g., waterproof, comfort fit).
2
Style & Sentiment
Aesthetic and emotional tones (e.g., casual, premium, relaxed).
3
Use Situation
When and where the item is featured (e.g., commuting, camping, outdoor activity).
4
Real-life utility adjustments (e.g., easy storage, machine washable).
At recommendation time, a customer's real-time intent signals are combined with these video vectors and existing product data. A customer who clicks on a running shoe video is no longer trapped in a repetitive loop of more running shoes. Instead, the system recognizes the context of outdoor activity and lightweight practicality, seamlessly surfacing relevant items across entirely different categories—like athletic socks, windbreakers, or leggings.
The Results
+57.5%
+29.4%
+21.7%
6.3s → 8.0s
Reduced from 1–2 hours to mere seconds
Beyond the immediate metric lift, the new system protects the user experience by automatically flagging and filtering expired promotions or out-of-stock items after a live broadcast ends. The architecture also features built-in graceful degradation: if the video pipeline ever encounters a delay, the system seamlessly defaults back to standard product data without disrupting the user interface.
Conclusion
With production stabilized, GS SHOP is moving into the next phase: a data-driven content flywheel. By identifying which specific visual "Appeal Points" consistently drive customer clicks and conversions, the team plans to feed these successful parameters directly back into the creative process. Using generative video models on Amazon Bedrock, future Short Pick clips will be shaped directly by real-time audience performance data, fully closing the loop between video production and viewer intent.




