Speak any language.
In seconds.
No account.
Real-time translation. Real voices. Any room. Works with earbuds, smart glasses, bone conduction, lapel mics, smart rings, and more. Live captions, voice identity, emotion-aware output — and Mesh Rooms for group sessions with a QR code. Offline-ready. Free to start.
Launching soon on iOS & Android
"You're at the doctor."
"Your mom speaks only Spanish."
"The nurse speaks only English."
"You grab your phone."
"You speak. She hears it — instantly."
"She answers. You hear it too."
Puente just works.
Live Demo
Your voice. Their ear.
Tap the mic and speak. Puente transcribes, translates, and plays audio privately in the other person's earbud — all in under 2 seconds.
Use headphones — spatial audio demo plays on activation
See it in action
See a real conversation.
This is exactly how Puente works. Push-to-talk. Translation in under 2 seconds. Conversation history saved automatically.
Real-world use
Built for the conversations that actually matter.
From Harris Health clinics to Houston ISD classrooms to construction sites in Katy. Profession Packs tune vocabulary for the exact context you're in.
Healthcare
Describe symptoms to an English-only nurse. Discuss a diagnosis. Explain medication instructions. Medical Pack adds clinical vocab, anatomy terms, and formal register.
Legal & Immigration
Immigration appointments, lease signings, rights explanations. Legal Pack preserves court terminology and formal register automatically.
Construction & Trades
Safety briefings, OSHA requirements, job-site task communication. Trades Pack tuned for foremen and bilingual crews — casual register, job-site vocabulary.
Built for the real world
5 modes.
Every situation covered.
From a quick counter conversation to a multilingual boardroom — pick the setup that fits, or let Puente detect it automatically.
Auto-detect
Fully hands-free
Puente's mic listens continuously. The moment you stop talking, silence triggers the translation — no button, no tap, nothing. Just speak naturally.
Phone onlyTabletop
Phone between you, no earbuds needed
Set the phone flat on a desk, counter, or table. Translation plays through the speaker — out loud for everyone present. Perfect for walk-up conversations, kiosks, and classrooms.
Phone onlyEarbud
Private audio, one pair split between two
Each person wears one earbud. Left ear hears one language, right ear hears the other. Both of you get a private, simultaneous translation — no one else hears a thing.
One shared pair of earbudsSmart Glasses
Hear it without looking down
Pair your Ray-Ban Meta, Xreal, or any supported smart glasses. Puente auto-detects the device and routes translation audio directly to your glasses speakers — eyes up, hands free, completely private.
Ray-Ban Meta · Xreal · EngoBone Conduction
Solo mode — hear through your cheekbones
Connect your Shokz or AfterShokz. Puente detects bone conduction headphones and switches to single-channel mode automatically — no left/right split, full translation delivered to the wearer. Ears stay open to your surroundings.
Shokz · AfterShokzRemote
Two phones, any distance on Earth
Each person uses their own phone. Share a 6-digit room code — no app account needed on either end. Speak in your language, they hear you translated in theirs in real time. Built for video calls, telemedicine, and remote family.
Two phones (one each)Group
Everyone speaks — everyone is heard
Place the phone in the center of the table. The ambient mic picks up all speakers and uses diarization to separate each voice — assigning a unique label and language to each person. Everyone sees the full conversation translated in their own language as a live caption feed. Built for meetings, multilingual dinners, and negotiations.
Phone in center (earbuds optional)Talk to anyone.
109 languages. 8 offline. No account. Under 4 seconds to your first translation.
Launching soon on iOS & Android
Made in Houston · Hecho en Houston
How It Works
Open. Share. Talk.
No account. No setup. No typing. Ready in under 4 seconds.
Open. No setup needed.
No account. No login. Puente opens right where you left off. Pick your two languages from 109 — or let Auto-detect handle it hands-free. Common phrases are pre-loaded in memory so your first translation is just as fast as your hundredth.
Pick your mode.
Choose how you want to communicate. Auto-detect listens hands-free — no buttons. Tabletop lays the phone flat for a group. Earbud gives each person private audio. Remote connects two phones anywhere on Earth with a 6-digit code.
Speak. Puente handles the rest.
In Auto-detect, just talk — Puente listens and translates automatically. In manual modes, hold to speak. Either way, translation plays in under 2 seconds in a voice matched to that speaker. Puente learns how each person talks and gets sharper every session. History saves automatically.
Features
Built different. Verified.
Auto Voice Matching — it sounds like you.
The moment you speak, Puente quietly analyzes your pitch, rhythm, and vocal energy. It determines whether you sound more male, female, or somewhere in between — then picks the closest matching voice in the target language. A deep-voiced man gets a deep male translation. A high-energy woman gets a high-energy response. After the first confident read, your voice profile is saved — no guessing on the second conversation. You can also set it manually in Settings and Puente will never override your pick.
Fast enough to feel like a real conversation.
Translation plays back in under 2 seconds — and common phrases are pre-loaded so the very first one feels instant. When you're offline, 8 languages keep working on-device with no signal at all (English, Spanish, French, German, Portuguese, Italian, Chinese, Japanese). Hospital basement, construction site, middle of nowhere: it works.
Pull out a speaker. It just works.
Connect any Bluetooth speaker — JBL, AirPods, whatever you have — and Puente automatically routes the translation audio to it. No button to press, no setting to change. On Android it wakes up as soon as the speaker connects. On iOS it switches the moment audio routes. Just pull it out and start talking.
Trained on your field, not just the dictionary.
Generic apps translate words. Profession Packs teach Puente the vocabulary your job actually uses — clinical terms, legal language, OSHA safety words, restaurant service phrases. 9 packs available (Medical, Legal, Trades, Restaurant, Finance, Education, Childcare, Emergency, Biblical). Emergency Pack includes a free 3-day trial. From $0.99.
One app. Every situation.
Counter conversation, doctor's office, job site, video call, family dinner — Puente has a mode for each one. Auto-detect listens hands-free. Tabletop plays out loud for a group. Earbud mode gives each person private audio. Remote connects two phones from anywhere. Group mode handles up to 8 people at once, tracking every speaker separately.
The other person hears what you meant — not just what you said.
Word-for-word translation often sounds wrong or loses the meaning entirely. Puente builds a running picture of the whole conversation — what you're talking about, the tone, who's speaking — and uses that to make every translation sound natural on the other end. Idioms get localized. Slang gets adapted. The register matches the situation.
The right tone without thinking about it.
A doctor visit sounds different from a restaurant order. Puente knows that. Set it to Auto, Formal, or Casual — or just activate a Profession Pack and it sets the right register automatically. Medical conversations come out formal and precise. Service industry conversations come out natural and friendly. You focus on talking, not on settings.
Your conversations stay yours.
No account. No login. No email required — ever. Puente doesn't store your audio anywhere. Nothing is sold, nothing is tracked. When you're offline, everything stays on your device. Even in cloud mode, audio is processed and immediately discarded. Your history lives only on your phone and you can delete it anytime.
Smart glasses, bone conduction, lapel mics — just works.
Smart glasses (Ray-Ban Meta, Xreal, Engo) auto-connect via A2DP and route translation to your lenses speakers — eyes up, hands free. Bone conduction headphones (Shokz, AfterShokz) are detected automatically and switch to single-channel solo mode. Professional lapel mics (Røde, DJI, Hollyland, Deity, Sennheiser) are auto-selected as the highest-priority input with no configuration needed. Connect your Apple Watch or Wear OS watch to trigger recording with a button press using Push-to-Talk mode.
Empathy Engine — your tone, translated.
Voice matching handles the who. The Empathy Engine handles the how. It reads six emotional dimensions in your voice in real time — pause density, vocal tremor, onset sharpness, dynamic range, rhythm regularity, and sustained vowel ratio — to build a full emotional fingerprint. Then it shapes the translated voice to match: slowing down when you're being deliberate, adding weight when you're emotionally present, staying smooth when you're calm. The person on the other end doesn't just hear your words translated. They hear your tone translated. If you're telling someone something important, the translated voice carries that weight.
Live Captions — see it as it's spoken.
A real-time caption bar floats on screen as translations arrive — showing exactly what was said and what it means, simultaneously. The speaker's name appears if known. Works on your phone screen today, with AR glasses caption output coming next. Set it to phone-only, glasses-only, or both from Settings.
Voice Identity — know who said what.
Puente learns who's speaking using voice fingerprinting and directional audio cues. In group conversations, each speaker gets their own color-coded lane in the transcript — so you always know who said what, not just what was said. Speaker attribution is live and automatic, with no setup required.
Emotion-Aware Translation — tone travels too.
Beyond the six vocal dimensions of the Empathy Engine, you can now manually lock a translation register: Auto, Formal, Casual, or Domain-specific for medical, legal, and trades contexts. The output tone adapts to match — and a badge on each translation card confirms which register was used. The Worker echoes back exactly how your words were delivered.
Voice Passthrough — your voice, their language.
When enabled, translated audio sounds like the original speaker — not a generic voice. Puente uses AI voice cloning with explicit two-step user consent. If cloning fails or times out, it falls back to natural TTS automatically. Your voice, in their language.
Mesh Rooms — one QR code. Every language.
Start a live translation room and share a QR code. Anyone who scans puente.chat/r/[roomID] joins and sees captions in their own language — in real time. No accounts, no setup. The host's local translation runs unaffected even if the room fails. Rooms auto-expire after 4 hours. Perfect for meetings, classrooms, and events with mixed-language groups.
Voice Intelligence
Puente learns your voice.
Then mirrors it back.
Most apps give everyone the same robotic voice. Puente treats your voice as a full human signal — not just a source of text.
The right voice. Every time.
The moment you speak, Puente quietly analyzes your pitch, rhythm, and vocal energy. It figures out whether you sound more male, female, or somewhere in between — and picks the closest matching voice in the target language.
If you're a deep-voiced man speaking Spanish, the English translation comes out in a deep male voice. If you're a high-energy woman speaking English, the Spanish response matches that energy. After the first confident read, your voice profile is stored — the second conversation, it already knows you.
Your tone, translated.
Voice matching handles the who. The Empathy Engine handles the how. It reads six real-time emotional dimensions in your voice and shapes the translated output to match — so the person on the other end hears not just your words, but your weight.
"Language barriers aren't just about words — they're about connection. A doctor explaining a diagnosis, a parent calming a scared child, a business conversation where trust is everything: the emotional texture of how something is said matters as much as what is said."
Puente is the only translation app that treats your voice as a full human signal.
Why Puente
Why Puente wins.
| Feature | Puente | Google Translate | iTranslate | Apple Translate |
|---|---|---|---|---|
| Context-aware rewrites (idioms, tone, register) | ✓ | — | — | — |
| Continuous voice conversation (no stops) | ✓ | One-shot | One-shot | One-shot |
| Auto voice matching (pitch, tone, gender) | ✓ | — | — | — |
| Empathy Engine (tone & emotion translated) | ✓ | — | — | — |
| Learns your voice profile over time | ✓ | — | — | — |
| Works offline (8 languages) | ✓ | Setup req. | — | ✓ |
| Bluetooth speaker auto-switching | ✓ | — | — | — |
| Dual-earbud BLE private audio | ✓ | — | — | — |
| Remote mode (2 phones, 6-digit code, any distance) | ✓ | — | — | — |
| Group mode (multi-speaker diarization, live captions) | ✓ | — | — | — |
| Profession Packs (9 domains) | ✓ | — | — | — |
| AI conversation summaries | ✓ | — | — | — |
| Zero login, zero account | ✓ | — | — | ✓ |
| One-time price (no subscription) | $9.99 | Free | $9.99/mo | Free |
| Languages supported | 109 | 100+ | 100+ | ~20 |
Profession Packs
Your job has its own language.
Now your translator does too.
9 specialist vocabulary packs. Each one trained on real domain terminology — not generic dictionary words. One-time purchase. Works offline.
Medical
Clinical vocab, medications, anatomy, symptom & diagnosis language. Formal register.
Legal
Court terms, rights language, immigration vocab, contract phrases. Never paraphrased.
Trades
Construction, plumbing, electrical, HVAC. OSHA safety terms. Job-site casual register.
Restaurant
Front-of-house and kitchen vocab, allergen terms, menu language. Both registers.
Finance
Loans, accounts, credit, APR, taxes. Built for Houston families navigating banking.
Education
Parent-teacher conferences, IEP terms, grade vocab. Tuned for Houston ISD.
Childcare
Pediatric terms, daycare, WIC, school enrollment. Warm register for families.
Biblical
Catholic Mass, liturgical Spanish, scripture register, sacramental terms. Elevated formal "usted."
Emergency
911 dispatch, first responder commands, emergency symptom descriptions. Speed and clarity first.
All 9 packs are one-time purchases. No subscription. Available inside the app.
Pricing
No subscription. Ever.
5 free translations every day. Or own Pro forever for $9.99 — one time, no subscription, ever. Need access for just a day? $1.99. Rare and regional languages are included in Pro. No account required for any tier.
No account needed to start
Unlocked via in-app purchase after download
Perfect for a single appointment
9 packs total · purchased inside the app
Our Story
Built in Houston.
Built for Houston.
Houston is the most linguistically diverse city in America. 44% of residents speak a language other than English at home — taquerias, Harris Health clinics, Houston ISD classrooms, construction sites in Katy.
We built Puente because we live this every day. The app is tuned for the exact contexts Houston demands — medical vocabulary, legal register, job-site Spanish, bilingual family conversations.
For Organizations
One code. Every device.
No IT headache.
Hospitals, schools, law offices, and government agencies use Puente with bulk activation codes. One code activates Pro on any device in your organization — no user accounts, no data leaving your network, no IT complexity.
Activation codes
One code activates Pro on any device. Distribute to your team in minutes — no enrollment, no SSO, no accounts.
HIPAA-aligned privacy
No conversation data stored on any server. Audio is processed and discarded. Nothing leaves the device.
Zero IT friction
No MDM enrollment. No SSO setup. No user accounts to manage. Download, enter code, translate.
Who uses it
Organization Pricing
Clinic / Small Org
$49/mo
or $499/yr — save 15%
Medical offices, restaurants, schools. Up to 10 staff. One code activates all devices.
Enterprise
$149/mo
or $1,499/yr — save 16%
Hospitals, school districts, multi-location. 10+ staff. Priority support.
Custom volume pricing available for large deployments.
Hardware
Works with what you already own.
Puente auto-detects connected devices and configures the right mode — no setup required. New devices are added with every update.
Earbuds Share mode — L/R split
One pair split between two people. Both people wear one earbud each.
Earbuds Share mode — L/R split
One pair split between two people. Both people wear one earbud each.
| Brand | Models |
|---|---|
| Apple | AirPods (all gen), AirPods Pro 1/2/3, AirPods Max |
| Samsung | Galaxy Buds, Buds+, Buds Pro, Buds2, Buds3 |
| Pixel Buds, Pixel Buds A-Series | |
| Jabra | Elite, Evolve, Enhance series |
| Sony | WF-1000XM5, WF-C700N, WF-SP900, WH-1000XM5 |
| Bose | QuietComfort Earbuds, QC45, Sport Earbuds, Ultra Open Earbuds |
| Beats | Studio Buds, Fit Pro, Powerbeats |
| Anker Soundcore | Liberty 3 Pro, Liberty 4, P3 |
| Nothing | Ear (1), Ear (2), CMF Buds, CMF Buds Pro |
| Oladance | OWS Pro (open-ear) |
| JLab | Go Air, Epic, Studio |
| Skullcandy | Indy, Dime, Rail |
| EarFun | Air, Free series |
| Sennheiser | CX True Wireless, Momentum TW |
| Puente | Puente Buds (first-party, coming soon) |
Smart Glasses Glasses mode — eyes-up
Auto-detected via A2DP. Translation plays through glasses speakers.
Smart Glasses Glasses mode — eyes-up
Auto-detected via A2DP. Translation plays through glasses speakers.
| Brand | Models |
|---|---|
| Ray-Ban Meta | Smart Glasses (all gen), Wayfarer, Headliner |
| Meta | Meta Glasses (standalone) |
| Xreal | Xreal Air, Air 2, Air 2 Pro, Air 2 Ultra |
| ActiveLook / Engo | Engo 2 cycling glasses |
| Even Realities | G1 smart glasses |
| Puente | Puente Glasses (first-party, future) |
Bone Conduction Solo mode — mono channel
Auto-detected. L/R split UI hidden. Full session delivered to the wearer.
Bone Conduction Solo mode — mono channel
Auto-detected. L/R split UI hidden. Full session delivered to the wearer.
| Brand | Models |
|---|---|
| Shokz | OpenRun, OpenRun Pro, OpenRun Pro 2, OpenFit, OpenFit Air, OpenMove, OpenSwim, OpenSwim Pro |
| AfterShokz | Aeropex, Trekz Air (legacy branding) |
| Puente | Puente Mono (first-party, coming soon) |
Lapel / Clip-On Mics Auto-priority input
Automatically selected as highest-priority input. No configuration needed.
Lapel / Clip-On Mics Auto-priority input
Automatically selected as highest-priority input. No configuration needed.
| Brand | Models |
|---|---|
| Røde | Wireless GO, Wireless GO II, Wireless ME, Lavalier GO |
| DJI | DJI Mic, DJI Mic Mini, DJI Mic 2 |
| Hollyland | Lark M1, Lark C1, Lark M2 |
| Deity | Pocket Wireless |
| Sennheiser | XSW-D Lavalier, MKE 400 |
| Puente | Puente Clip (first-party, coming soon) |
Smartwatch PTT Push-to-Talk trigger
Physical watch button starts/stops recording. Momentary (hold) or toggle (tap) mode.
Smartwatch PTT Push-to-Talk trigger
Physical watch button starts/stops recording. Momentary (hold) or toggle (tap) mode.
| Platform | Devices |
|---|---|
| Apple Watch | All models (Series 1+) paired to iPhone |
| Wear OS | Samsung Galaxy Watch 4/5/6/7, Google Pixel Watch 1/2/3, Fossil, TicWatch |
Requires the companion Puente Watch app — coming soon.
Smart Rings Ring gesture — hands-free PTT
Tap, long-press, or swipe your ring to trigger translation without touching your phone.
Smart Rings Ring gesture — hands-free PTT
Tap, long-press, or swipe your ring to trigger translation without touching your phone.
Full gesture support (tap · long press · double tap · swipe)
| Ring | Tap | Long | Double | Swipe |
|---|---|---|---|---|
| Colmi R02 / R06 / R10 | ||||
| Circular Slim / Slim 2 | ||||
| BOHE Smart Ring | ||||
| Putere Flex | ||||
| Nimb | ||||
| Motiv (legacy) |
BLE wake only (tap to open — no gesture discrimination)
| Ring | Wake |
|---|---|
| Oura Ring Gen 3 / Gen 4 | |
| Ultrahuman Ring AIR | |
| Evie Ring | |
| Amazfit Helio Ring | |
| Noise Pulse 2 Max |
Oura, Ultrahuman, and similar health rings can open the app on tap but can't do long-press-to-record or swipe volume — those gestures live inside their proprietary companion SDK, not on an open GATT characteristic.
Bluetooth Speakers Tabletop mode
Auto-detected at connect time. Both people hear the translation out loud. No earbuds required.
Bluetooth Speakers Tabletop mode
Auto-detected at connect time. Both people hear the translation out loud. No earbuds required.
| Brand | Models |
|---|---|
| JBL | Clip 4, Clip 5 (validated test device) |
| Puente | Puente Speaker (first-party, coming soon) |
Any Bluetooth A2DP speaker that shows up as a trusted device will trigger tabletop mode.
FAQ
Questions people ask
Is there a subscription?
No. Pro is a one-time $9.99 purchase — you own it forever, including all rare and regional languages. There’s also a $1.99 Day Pass for 24 hours of full access. Profession Packs are one-time purchases from $0.99.
What languages does it support?
109 languages via cloud AI (DeepL Voice + Google Translate fallback). 8 languages work fully offline with no signal: English, Spanish, French, German, Portuguese, Italian, Chinese, and Japanese — powered by on-device Whisper AI.
Does it work without internet?
Yes. On-device Whisper AI keeps 8 languages running with zero signal — hospital basements, rural job sites, airplane mode. When you're back online, all 109 languages switch in automatically.
What happens to my conversations?
They stay on your device. Puente does not store audio on any server — it's processed and discarded. Your transcripts and history live only on your phone. No account, no login, no data sold, no ads.
Can I use it for a group of people?
Yes. Group mode (Pro) places the phone in the center of the table. The ambient mic picks up all speakers, separates each voice using diarization, and assigns each person a unique label and language. Everyone sees live captions in their own language. Up to 8 people, one phone.
How accurate is it?
Puente uses DeepL Voice API — among the highest-accuracy engines available — with Google Translate as an automatic fallback. For specialized fields, Profession Packs add domain vocabulary so medical terms, legal language, and trade jargon are translated correctly from the first sentence. The AI also tracks the conversation as it unfolds and rewrites translations for natural speech — not just word-for-word.
Can my organization or clinic use it?
Yes. Puente offers bulk activation codes for hospitals, schools, law offices, and government agencies. One code activates Pro on any device. No user accounts, no MDM enrollment, no IT setup required. Contact us for bulk pricing.
Download free.
Start talking.
5 free translations every day. Own Pro forever for $9.99. No account. iOS & Android.
Launching soon on iOS & Android