fix: Fuzzy completion with Unicode chars #887

ysthakur · 2025-03-13T01:20:27Z

This PR fixes a panic that happens when trying to display fuzzy-matched suggestions. Previously, #886 fixed this for prefix matching. Closes nushell/nushell#12680 and nushell/nushell#15302.

This is temporary, until we start highlighting fuzzy-matched suggestions properly (underlining only the graphemes that were matched, rather than underlining a prefix of the suggestion).

I chose to operate on graphemes rather than characters because that's what Nucleo operates on and because they seem to correspond better to a block of text shown on the screen. First we get the width of the shortest string, and then for every suggestion, we highlight the suggestion up to the last grapheme that matches or exceeds this width.

src/menu/columnar_menu.rs

ysthakur · 2025-03-13T16:40:53Z

Temporarily converting this to a draft because I don't think this checks for suggestions shorter than the text being completed. Reported here: reubeno/brush#406.

ysthakur · 2025-03-14T22:10:44Z

Just threw in a test to make sure that suggestions shorter than the text being matched don't cause a panic. Should be ready to go now.

blindFS · 2025-03-15T04:06:42Z

I'd like to bring another (probably) related issue nushell/nushell#4595 into the sight here. @uek-1 would know it better.

ysthakur · 2025-03-15T06:28:21Z

I'd like to bring another (probably) related issue nushell/nushell#4595 into the sight here. @uek-1 would know it better.

@blindFS I'm not sure I understand. Isn't handling quotes in partial completions a related but separate matter?

blindFS · 2025-03-15T06:37:23Z

I'd like to bring another (probably) related issue nushell/nushell#4595 into the sight here. @uek-1 would know it better.

@blindFS I'm not sure I understand. Isn't handling quotes in partial completions a related but separate matter?

Yes, related but separate, just in case we need some top-down redesign to address them at the same time.

To be clear, that issue is mainly about common prefix completion (which works if both items are not quoted). I mean we should probably strip the quotes when extracting the common prefix.

sholderbach

Thanks for tackling this complexity.

I don't quite follow why this code has to use the vague oracle of UnicodeWidth and can't simply use the grapheme/byte indices etc. which are the relevant unit for where we actually perform the highlighting/string splitting etc.

sholderbach · 2025-03-15T15:04:31Z

src/menu/menu_functions.rs

+pub fn split_suggestion(sugg: &str, match_width: usize) -> (&str, &str) {
+    let mut match_end = sugg.len();
+    let mut curr_width = 0;
+    for (offset, grapheme) in sugg.grapheme_indices(true) {
+        if curr_width >= match_width {
+            match_end = offset;
+            break;
+        }
+        // Strip quotes from the beginning
+        if offset == 0 && (grapheme == "`" || grapheme == "'" || grapheme == "\"") {
+            continue;
+        }
+        curr_width += grapheme.width();


I guess with this code doing operations on Unicode width would not panic.

But I have a question if this misbehaves if the user tries to complete starting from " etc. or " being a particular suggestion? (Is this relying on a particular behavior of our Completer implementations to take out all quotes?)

I guess with this code doing operations on Unicode width would not panic.

But I have a question if this misbehaves if the user tries to complete starting from " etc. or " being a particular suggestion? (Is this relying on a particular behavior of our Completer implementations to take out all quotes?)

True, that's one of reasons why this is just a temporary fix.

Yeah, this won't work well if a quote is part of what the user types. As blindFS said, it's a temporary fix. I was planning on allowing adding a separate display value on Suggestions, so that when showing suggestions for files, the displayed values could be unquoted and unescaped. Then, Reedline would no longer have to strip quotes.

This won't solve the problem of getting a common prefix that accounts for quotes, though, so that's going to need some more thought. One obvious thing I can think of is pushing the burden of finding a common prefix onto completers.

This current quote stripping is really for Nushell, and I assume it won't hurt other users of Reedline either.

ysthakur · 2025-03-16T19:54:22Z

I don't quite follow why this code has to use the vague oracle of UnicodeWidth and can't simply use the grapheme/byte indices etc. which are the relevant unit for where we actually perform the highlighting/string splitting etc.

I just used width so that when displaying, the underlined sections would be (around) the same width. But yeah, looking at graphemes makes more sense, I've updated it now

fix: consider graphemes when split match str

b62ad02

ysthakur mentioned this pull request Mar 13, 2025

fix: columnar_menu create_string with quoted suggestions #886

Merged

blindFS reviewed Mar 13, 2025

View reviewed changes

src/menu/columnar_menu.rs Outdated Show resolved Hide resolved

Rename shortest_base_string to match_width

c86a1ba

ysthakur marked this pull request as draft March 13, 2025 16:39

sholderbach mentioned this pull request Mar 13, 2025

Fuzzy autocompletion of files bugs nushell/nushell#12680

Open

Test suggestions shorter than typed

6606347

ysthakur marked this pull request as ready for review March 14, 2025 22:09

ysthakur requested a review from sholderbach March 14, 2025 22:11

sholderbach reviewed Mar 15, 2025

View reviewed changes

ysthakur added 2 commits March 16, 2025 15:38

Use number of graphemes rather than width

bedb982

Test empty match

2bf4299

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Fuzzy completion with Unicode chars #887

fix: Fuzzy completion with Unicode chars #887

ysthakur commented Mar 13, 2025

ysthakur commented Mar 13, 2025

ysthakur commented Mar 14, 2025

blindFS commented Mar 15, 2025

ysthakur commented Mar 15, 2025

blindFS commented Mar 15, 2025 •

edited

Loading

sholderbach left a comment

sholderbach Mar 15, 2025

blindFS Mar 15, 2025

ysthakur Mar 16, 2025 •

edited

Loading

ysthakur commented Mar 16, 2025

fix: Fuzzy completion with Unicode chars #887

Are you sure you want to change the base?

fix: Fuzzy completion with Unicode chars #887

Conversation

ysthakur commented Mar 13, 2025

ysthakur commented Mar 13, 2025

ysthakur commented Mar 14, 2025

blindFS commented Mar 15, 2025

ysthakur commented Mar 15, 2025

blindFS commented Mar 15, 2025 • edited Loading

sholderbach left a comment

Choose a reason for hiding this comment

sholderbach Mar 15, 2025

Choose a reason for hiding this comment

blindFS Mar 15, 2025

Choose a reason for hiding this comment

ysthakur Mar 16, 2025 • edited Loading

Choose a reason for hiding this comment

ysthakur commented Mar 16, 2025

blindFS commented Mar 15, 2025 •

edited

Loading

ysthakur Mar 16, 2025 •

edited

Loading